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ABSTRACT 


An  algebraic  approach  is  developed  for  multiple  time  scale 
decomposition  of  a  linear  system  using  the  Smith  structure  of  the  system 
matrix  viewed  as  the  matrix  of  functions  of  a  small  parameter  c.  Hus 
derivation  makes  clear  that  both  the  necessary  and  sufficient  multiple 
semi-stability  (MSST)  condition,  which  ensures  well-defined  multiple 
time  scale  behavior  and  the  time-scale-de composed  system  structure  which 
approximates  the  original  system  are  closely  related  to  the  so-called 
Schur  complements  of  a  certain  matrix.  Furthermore,  this  decomposition 
has  been  extended  to  a  larger  class  of  systems,  satisfying  the  so-called 
aultlple  ffeni-rsiflEig  jiuUstructucs  (mssns)  condition. 

The  algebraic  approach  is  also  applied  to  examine  the  qiiestions  of 
the  feedback  control  of  the  linear  systems.  Specifically -we  present^ 
results  on  time  scale  modif ications  by  state  feedback.  '  ;  '77’ j 

>  X  /  fM  '■  ‘  ' -  -  ( 

0  Hie  characterization  of  the  relationship  among  the  eigenvalues  of 
A(c),  its  invariant  factors  and  the  MSSNS  and  MSST  conditions  has  been 
thoroughly  studied.  It  is  shown  that  the  MSSNS  condition  is  not  only 
equivalent  to  the  non-singularity  of  the  Schur  complements  of  certain 
matrix  but  also  equivalent  to  1)  the  eigenvalues  and  the  invariant 
factors  having  the  same  orders  and  2)  a  condition  which  exposes  the 
relationship  among  the  order  of  the  gcd  of  all  ixi  principal  minors,  the 
order  of  their  sum  and  the  invariant  factors.  The  MSST  condition  is 
equivalent  to  1)  the  Schur  complements  being  Hurwitz  and  2)  a  condition 
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which  reveals  the  connection  between  the  MSSNS  and  MSST  conditions  and 
the  eigenvalues  of  A(c). 

Using  the  algebraic  approach,  a  scaling  procedure  is  developed  to 
transform  a  system  having  no  uniform  time  scale  approximation  to  one 
which  does.  This  procedure  is  applied  to  high-gain  feedback  problems. 
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The  class  of  systems  considered  in  this  thesis  are  linear,  time- 
invariant  systems  whose  parameters  are  subject  to  a  small  perturbation. 
Mathematically,  this  can  be  stated  through  the  system  equation 

x(t)  =  A(e)x(t)  +  B(e)u(t)  (1.1) 

Here  x  and  u  are  n-  and  m-  dimensional  state  and  control  vectors 
respectively;  and  A(e),  B(e)  are  nxn  and  nxm  matrices  whose  entries  are 
analytic  at  e=0. 

As  pointed  out  by  Coderch  and  et.al.  [1],  [7],  under  certain 
conditions  the  undriven  system 

x(t)  =  A(e)x(t)  (1.2) 

will  exhibit  multiple  time  scale  behavior.  Namely,  the  state  variable 
x(t)  can  be  "approximated"  by  a  new  variable  z(t)  whose  components 
evolve  at  several  time  scales  with  time  constants  proportional  to 


1,1/e, 1/e*, .  In  other  words,  the  undriven  system  which  may  have 

very  large  dimension  can  be  broken  down  into  several  subsystems  with 
lower  dimensions,  each  of  which  focuses  on  different  time  scale. 

The  phenomenon  just  described  is  commonly  referred  to  as  a 
consequence  of  the  singular  perturbation  of  the  system  matrix  A(e). 
Specifically,  as  noted  by  Coderch  [71,  the  system  (1.2)  may  have  several 
time  scales  only  if  A(e)  losses  rank  as  e  goes  to  zero+.  If  the  rank  of 
A(0)  is  equal  to  the  rank  of  A(e),  then  A(e)  is  regularly  perturbed  and 
has  only  one  time  scale. 

The  value  of  singular  perturbation  analysis  for  the  system  (1.2) 
rests  on  the  fact  that  it  achieves  model  order  reduction  by  separating 

0 

the  system's  time  scales,  that  is  by  considering  slow  and  fast  phenomena 
separately.  Consequently,  problems  of  analysis  and  control  for  systems 
with  very  large  dimension  may  boil  down  to  several  problems  of  smaller 
dimension.  In  addition,  there  are  situations  in  which  the  original 
system  is  not  singularly  perturbed  but  in  which  the  control  which  is 
applied  causes  the  overal  closed-loop  system  to  possess  several  time 


scales.  For  example,  singularly  perturbed  system  can  result 
from  the  optimal  control  of  a  system  using  a  quadratic  cost  functional 
having  a  small  penalty  on  the  control  [2],  [13],  The  results  in  these 
references  reveal  the  existance  of  multiple  time  scales  in  such  systems. 

There  have  been  numerous  papers  dedicated  to  this  subject  (  see 
survey  (io],{23)  among  which  Kokotovic  et.al.  have  thoroughly  studied  the 
two  time  scale  case.  Then  Coderch,  et.al.  [1],  [7]  carried  this  idea  to 
the  multiple  time  scale  case  and  derived  some  basic  results.  In  this 
thesis  we  develop  a  new  algebraic  approach  to  multiple  time  scale 
analysis  which  allows  us  to  obtain  a  clearer  and  deeper  understanding  of 
time  scale  decomposition  for  the  general  systems  (1.1),  (1.2).  Not  only 
does  this  approach  allow  us  to  gain  more  insight  into  the  nature  of 
systems  with  several  time  scales  but  it  also  provides  a  framework  within 
which  it  is  possible  for  us  to  consider  and  solve  several  other 
important  problems,  in  this  chapter  we  first  briefly  describe  previous 
work  which  forms  the  foundation  on  which  our  research  is  built.  We  then 
give  the  outline  of  this  thesis  and  surnnarize  its  contributions. 


1.2  Background 

The  origin  of  the  multiple  time  scale  problem  can  be  traced  to  the 
so-called  boundary- layer  problem  in  ordinary  differencial  equation 
theory  where  a  small  positive  number  e  is  incorporated  to  allow 


perturbations.  Typically  such  a  problem  gives  rise  to  a  boundary  layer, 
which  is  a  narrow  interval  of  time  close  to  the  origin  where  the 
solution  of  the  differential  equation  changes  rapidly.  The  thickness  of 
the  boundary  layer  approuches  zero  as  e — >0.  Outside  the  boundary 
layer,  in  the  outer  region,  the  solution  varies  slowly.  Therefore  the 
system  presents  two  time  scales.  A  simple  example  will  show  this 
phenomenon.  Consider  the  perturbed  differential  equation 

ex(t)  +  (l+c)x(t)  +  x(t)  =  0,  x(0)=0,x(l)=l 

The  exact  solution  of  this  problem  is 


x(t)  * 


2-t  -  e-t/e 
B-1  _  P-Ve 


Therefore  this  problem  exhibits  two  time  scales  having  time  constants  of 
order  1  (slow)  and  order  e  (fast).  In  general,  the  boundary  layer 
method  is  based  on  the  fact  that  if  a  solution  of  a  differencial 
equation  is  slowly  varying  except  in  isolated  boundary  layers,  then  it 
may  be  easy  to  obtain  a  leading  order  approximation  without  directly 
solving  the  equation. 

Kokotovic  and  co-workers  studied  a  special  class  of  systems  of 
singular  perturbed  linear  differential  equations  [4],  [5] 


*l(t)  -  A-|  i  x-|  (t)  +  A-;  (t) 

ex 2 (t)  =  A21xl^  +  A?2x? (t) 

If  we  change  the  argument  t  to 
t  =  t/e 
then  we  have 


(1.3) 


l/ex^t)  =  A^jXj^t) 


+  Ai2X2(c) 


X2(c)  =A21x1(c) 


+  A22x2 (c) 


Or 


I 

x= 


cAn 

A21 


^12 

A22 


x  =  A(e)x 


(1.4) 


Thus,  (1.3),  or  equivalently  (1.4)  is  a  special  case  of  (1.2).  If  in 
addition  A22  and  A11-A21A22-1A^2  have  all  their  eigenvalues  strictly  in 
the  left-half  plane,  then  system  (1.2)  has  well-defined  two  time  scale 
behavior  (see  below)  and  the  eigenvalues  of  A(e)  will  fall  in  two  groups 
as  e  approaches  zero,  one  of  order  1  the  other  of  order  e. 

In  the  previous  subsection  we  gave  a  verbal,  intuitive  definition 
of  what  we  mean  by  well-defined  multiple  time  scale  behavior,  namely 
that  the  system  can  be  decomposed  into  several  subsystems,  one  at  each 
time  scale.  The  following  is  a  precise  statement  of  what  we  mean. 

Definition: 

System  (1.2)  has.  well-defined  multiple  time  scale  behavior  if  there 
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exist  constant  matrices  Ag,  Ap....,  A^,  T  and  integers  kgS-k^S-Sk^i 
such  that 

lim  sup  |  |exp{A(e)t} 
e->0  t20 

-  Texp{diag[ek0Ag,€^1A;i/...€^"1An_.1]}T"1|  I  =  0 

In  this  case  we  will  frequently  say  that  [Ag,....,An_i'T]  defines 
a  time  scale  deoonposition  of  (1.2). 

It  is  clear  from  the  definition,  that  if  (1.2)  has  well-defined 
time  scale  behavior  then  the  state  variable  x(t)  can  be  approximated 
(after  a  linear  transformation)  by  n  components,  each  evolving  at  a 
different  tiirte  scale.  Furthermore,  the  approximation  is  uniformely 
valid  on  entire  half-line  (0,*>)+. 

While  Kokotovic  considered  the  two  time  scale  case,  Coderch,  et. 
al.  [1],  [7]  presented  the  first  oomplete  solution  of  the  general  case 
(1.2).  In  this  work  they  present  necessary  and  sufficient  conditions 
for  the  system  (1.2)  to  have  well-defined  time  scale  behavior.  Their 
proofs  suggest  a  procedure  for  extracting  and  displaying  the  multiple 

+  As  point  out  by  Coderch  [7],  exp{A(0)t}  is  a  good  approximation  for 
exp{A(e)t}  on  any  finite  interval  [0,T].  Therefore  the  notion  of 
multiple  time  scale  makes  sense  only  when  the  concern  is  the  whole 
interval  [0,»). 


time  scale  structure  of  (1.2).  However  this  method  is  quite  involved 
and  does  not  lend  itself  to  easy  interpretation  or  computation.  For 
example,  the  relationship  between  the  complex  results  of  [1],  [7]  and 
intuitively  simple  results  of  [4],  [5]  is  not  at  all  apparent.  Making 
clear  this  relationship  and  obtaining  a  conceptually  and  computationally 
simple  solution  in  the  general  case  are  two  of  the  objectives  achieved 
in  this  thesis. 


1.3  Outline  of  This  Thesis 

The  first  portion  of  the  research  described  in  this  thesis  deals 
with  an  algebraic  approach  to  the  time  scale  decomposition  of  (1.2). 
Specifically,  in  Chapter  2  we  consider  the  Smith  form  of  A(e)  over  the 
ring  T  of  all  functions  of  c  which  are  analytic  at  e=0,  and  based  on 
this  form  we  are  able  to  obtain  a  more  direct  and  simple  description  of 
the  multiple  time  scale  decomposition.  This  derivation  makes  clear  that 
both  the  necessary  and  sufficient  multiple  semi-stability  (MSST) 
condition,  which  ensures  well-defined  multiple  time  scale  behavior  and 
the  time-scale-decomposed  system  structure  which  approximates  the 
original  system  are  closely  related  to  the  so-called  Schur  complements 
of  a  certain  matrix.  In  doing  this  we  also  are  able  to  make  clear  the 
connection  between  the  general  results  of  Coderch  et.  al.  and  those  of 
Rokotovic.  Furthermore,  having  established  this  framework  we  are  able 
to  use  it  to  solve  several  additional  problems.  In  particular  we  are 


able  to  extend  our  decomposition  (with  a  modified  notion  of  well-defined 
time  scale  behavior)  to  a  larger  class  of  systems,  satisfying  the  so- 
called  multiple  semisimple  nullstructure  (MSSNS)  condition,  and,  at  the 
end  of  Chapter  2,  we  use  our  results  to  examine  the  questions  of  the 
feedback  control  of  the  system  (1.1).  Specifically  we  present  results 
on  time  scale  modification  by  state  feedback. 

Chapter  3  deals  with  the  characterization  of  the  relationship  among 
the  eigenvalues  of  A(e),  its  invariant  factors  and  the  MSSNS  and  MSST 
conditions.  It  is  shown  in  this  chapter  that  the  MSSNS  condition  is  not 
only  equivalent  to  the  non-singularity  of  the  Schur  complements  of  a 
certain  matrix  but  it  is  also  equivalent  to  1)  the  eigenvalues  and  the 
invariant  factors  having  the  same  orders  and  2)  a  condition  which 
exposes  the  relationship  among  the  orders  of  the  gcd  of  all  ixi 
principal  minors,  the  order  of  their  sum,  and  the  invariant  factors. 
These  results  provide  us  with  a  procedure  for  computing  the  orders  of 
the  eigenvalues  and  invariant  factors  and  for  checking  the  MSSNS 
condition.  Also,  in  this  chapter  we  show  that  the  MSST  condition  is 
equivalent  not  only  to  the  Schur  complements  of  a  certain  matrix  being 
Hurwitz  (Chapter  2)  but  also  to  a  condition  on  the  orders  of  the  real 
and  imaginary  parts  of  the  eigenvalues  of  A(e),  which  in  turn  reveals 
the  connection  between  the  MSSNS  and  MSST  conditions. 

In  Chapter  4  we  use  our  results  on  the  relationship  between  the 
orders  of  eigenvalues  and  invariant  factors  to  explore  the  use  of 
amplitude  scaling  to  transform  a  system  matrix  without  MSSNS  into  one 


that  does  have  this  property.  The  analysis  involved  in  this 
investigation  is  rather  delicate  as  it  involves  careful  examination  of 
the  orders  of  principal  minors  and  the  identification  of  key  elements  of 
the  matrix  that  must  be  scaled.  The  end  result  of  our  efforts  is  a 
procedure  for  determining  such  a  scaling  matrix  for  systems  satisfying 
certain  conditions.  We  then  apply  our  results  to  interpret  and 
generalize  recent  results  on  time  scale  analysis  of  high  gain  or  nearly 
singular  optimal  feedback  systems. 

Finally,  in  Chapter  5  we  briefly  summarize  the  main  results  of  the 
thesis  and  discuss  several  directions  for  further  research. 


1.4  Main  Contributions  of  This  Thesis 

We  feel  the  main  contributions  of  this  thesis  are  the  following. 

1.  We  present  a  simpler  and  clearer  picture  of  the  multiple  time 
scale  decomposition  of  a  general  perturbed  linear  system  based  on  an 
algebraic  approach  which  allows  further  development  in  several 
directions. 

2.  We  make  clear  the  connection  between  Kokotovic‘s  explicit 
approach  for  the  two  time  scale  decomposition  case  and  Coderch's 


elaborate  multiple  time  scale  results. 

3.  We  thoroughly  study  the  MSSNS  and  MSST  conditions  relating  them 
to  the  algebraic  structure  of  A(e).  By  exposing  the  important  role  that 
the  orders  of  the  invariant  factors  and  eigenvalues  play,  the 
interpretation  of  those  conditions  is  clarified. 

4.  We  develop  an  algorithm  to  extend  time  scale  decompositions  to  a 
larger  class  of  systems  which  satisfy  MSSNS  but  not  MSST. 

5.  We  present  results  on  time  scale  assignment  through  state 
feedback  control. 

6.  We  make  clear  the  relationship  between  the  MSSNS  condition  and 
the  MSST  condition.  Specifically,  A(e)  satisfies  the  MSST  condition  if 
and  only  if  it  satisfies  the  MSSNS  condition  and  the  orders  of  the  real 
parts  of  its  eigenvalues  are  equal  to  or  less  than  those  of  the 
corresponding  imaginary  parts. 

7.  We  reveal  the  role  of  the  gcd's  of  the  principal  minors  (not  all 
minors  as  in  the  general  case)  in  determining  the  invariant  factors  of  a 
system  with  MSSNS  and  develop  an  algorithm  similar  to  that  of  the  so- 
called  Newton  polygon.  We  show  that  if  the  system  has  well-defined 
time  scale  behavior  (or  more  generally  if  A(e)  just  has  MSSNS)  then  this 
algorithm  determines  the  orders  of  the  various  time  scales  and  the 
dimensions  of  the  subsystem  at  each  time  scale. 


8.  We  develop  a  procedure  for  amplitude  scaling  to  transform  a 


system  which  does  not  satisfy  the  MSSNS  condition  to  a  system  which 
does.  If  the  original  system  satisfies  some  conditions  studied  in  this 
thesis,  then  after  scaling  the  resulting  system  will  have  well-defined 
multiple  time  scale  behavior. 

9.  We  apply  the  scaling  procedure  developed  in  this  thesis  to  high- 
gain  feedback  problems,  leading  to  an  interpretation  and  generalization 
of  results  in  the  literature. 


CHAPTER  2 


ALGEBRAIC  ASPECTS  OF  TIME  SCALE  BEHAVIOR 


2.1  INTRODUCTION 

As  we  pointed  out  in  Chapter  1,  the  system  we  shall  consider  in 
this  thesis  is  a  perturbed,  linear,  time-invariant  system 

x(t)  =  A(e)x(t)  +  B(e)u(t)  (2.1) 

and  its  undriven  form 

x(t)  =  A(e)x(t)  (2.2) 

where  A(e)  and  B(e)  are  nxn  and  nxm  matrices  whose  entries  are  functions 
of  a  small  parameter  e  analytic  at  e=0.  Kokotovic  and  oo-worker  [6,  10] 
have  thoroughly  studied  a  special  case  of  (2.2)  in  the  two  time  scale 
case,  which,  as  we  discussed  in  Chapter  1  Eq.  (1.4),  corresponds  to 
A(e)  having  the  special  form 


with  ah“a12a22  a21  Hurwitz 


Coderch  et.  al.  [7,1]  studied  the  general  case  of  (2.2)  and  derived 
necessary  and  sufficient  conditions  for  the  system  (2.2)  to  have  a  well- 
behaved  multiple  time  scale  description.  In  Section  2.2  we  give  a  more 
detailed  review  of  their  work  and  point  out  the  limitations  of  their 
results  which  have  motivate  our  work. 

In  particular,  in  this  chapter  we  develop  an  algebraic  approach  for 
determining  the  multiple  time  scale  structure  of  (2.2).  This  method 
involves  the  examination  of  the  Smith  form  of  A(e)  over  a  particular 
local  ring  T.  In  Section  2.3  we  give  the  definition  and  basic  feature 
of  this  ring.  In  Section  2.4,  we  introduce  the  so-called  explicit  form 
of  the  perturbed  system  (2.2)  which  is  closely  related  to  the  Smith  form 
of  A(e).  We  also  show  that  after  some  invertible  linear 
transformations,  the  solution  of  this  explicit  form  is  a  good 
approximation  (in  an  asymptotic  sense)  to  that  of  system  (2.2)  if  in 
fact  the  original  system  has  well-defined  multiple  time-scale  behavior  ( 
a  property  we  define  in  the  sequel).  Moreover,  using  the  explicit  form 
we  can  define  a  straightforward  procedure  to  check  if  a  system  has  well- 
defined  time  scale  behavior.  At  the  end  of  the  section  we  provide  an 
overview  of  the  major  results  along  these  lines  that  are  developed  in 


the  following  sections. 


In  Section  2.5,  we  give  a  first  derivation  of  the  multiple  time 
scale  approximation.  This  derivation  is  in  fact  an  extension  of  the 
usual  two  time  scale  argument.  Then  in  Section  2.6  a  proof  of  the 
asymptotic  properties  of  this  approximation  is  given  based  on  making 
explicit  the  connection  with  Coderch's  results.  It  turns  out  that  the 
time-scale-seperated  system  which  approximates  the  original  system  is 
determined  by  a  sequence  of  Schur  complements  of  the  system  matrix  in 
explicit  form.  A  major  consequence  of  this  is  that  the  computational 
procedure  we  derive  is  far  more  transparent  than  that  proposed  in  [7], 
The  eqi valence  between  the  results  obtained  in  Section  2.5  and  2.6  is 
established  in  Section  2.7.  This  development  makes  clear  the 
relationship  between  Kolotovic's  two  time  scale  result  and  Coderch's 
multiple  time  scale  result.  Finally,  in  Section  2.8  we  review  our 
results  in  order  to  place  our  contribution  in  its  proper  perspective. 
Specifically  our  approach  establishes  a  framework  that  not  only  exposes 
the  essential  nature  of  time  scale  decompositions  in  a  very  clear 
fashion,  thereby  improving  our  understanding  of  such  decompositions,  but 
also  provides  a  starting  point  for  posing  and  solving  a  variety  of 
problems  that  are  not  so  easily  posed  or  solved  using  previously 
developed  approaches.  Chapters  3  and  4  contain  several  important 
results  of  this  type,  as  does  the  end  of  Section  2.8  in  which  we  solve 
several  problems  including  one  that  had  been  proposed,  but  not  solved, 
by  Coderch  [7]. 


2.2  PREVIOUS  WORK 


As  pointed  out  in  Section  2.1/  not  only  is  our  work  closely  related 
to  Kokotovic's  two  time  scale  results  and  Coderch's  miltiple  time  scale 
approach,  but  it  also  establishes  the  clear  and  simple  relationship 
between  these  earlier  results.  Therefore  at  the  start  it  is  important 
that  we  describe  their  work  in  some  depth.  In  this  section  we  give  an 
outline  of  their  work  and  point  out  several  issues. 


2.2.1  Two  Time  Scale  Results 


The  two  time  scale  singular  perturbation  method  developed  by 
KokotoviC/  et.al  [6.10]  is  based  on  a  linear  time  invariant  system  with 
a  small  constant  c  on  the  left  hand  side  of  its  state  equation: 


"X1  _  ^11x1+A12x2* 

^11  A12 

X1 

=  A 

*1 

-ex2J  La21x1+a22x2- 
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There  are  two  well-known  results  about  this  system  for  small  c. 

1.  If  A22  and  A11=A11-A12a22~1a21  are  non-singular,  then  the 
eigenvalues  of  (2.3)  fall  in  two  groups  as  e — >0.  One  group  approaches 
the  eigenvalues  of  A^.  The  other  approaches  that  of  eA22.  The  matrix 


\ 


is  commenly  referred  to  as  the  Schur  complement  of  A22  in  the  matrix 
A  [3],  We  will  have  much  more  to  say  about  Schur  complements  as  we 
develop  our  approach  in  following  sections. 

2.  If  A22  and  A^  are  Hurwitz,  then  the  system  has  "well-defined 
time-scale  structure"  as  e — >0.  The  fast  time  scale  is  of  order  1  and 
the  slow  time  scale  of  order  1/e. 

Mathematically,  what  "well-defined  time-scale-structure"  means  can 
be  explained  as  follows. 


Define 


and 


Yl(t) 

y2<t) 


=T 


X]/t) 

x2{t) 


(2.4) 


Then  under  the  condition  that  A22,  A-q  are  Hurwitz,  we  have 


ri^n 


fys^ 


+  0(e), 


yf  (t/e) 


0<t 


(2.5) 


where  the  0(e)  term  is  bounded  uniformly  in  t  on  [0,*»)  as  e — >0  and 


ys(t)  and  (t/e)  are  called  the  "slow"  and  "fast"  states  respectively 
and  satisfy 


ys(t)  =Anys(t),  ys(0)  =  x^O) 

Yf(t)  =  (t) ,  yf (0)  =  X2(0)+A22  ^2^x^(0)  (2.6) 


If  we  define 


A(e)  = 


A11  a12 

e  e  *A22 


in  view  of  (2.4),  (2.5)  and  (2.6),  we  have 


lim  sup  |  |Texp{A(e)t}'Ir’1  -  exp{diaglAu,  l/eA22]t}  1 1=0 
e->0  t20 


Furthermore,  if  we  change  the  time  scale  to  x  =t/e,  we  have 


^(c)- 

=S 

,eAll  ^12" 
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Then  similarly, 

lim  sup  |  |Texp{A(e)c}T“3  -  exp{diag  [eAllf  A22]t}||=0  (2.7) 
e->0  c20 


In  general,  as  we  defined  in  Chapter  1,  if  A(e)  is  analytic  at  e=0 


and  Hurwitz  for  e>0  and  there  exist  constant  matrices  A^,  A2,  T  such 
that  uniform  convergence  as  in  (2.7)  holds,  then  we  say  that  A(e)  has 
well-defined  two  time  scale  behavior.  Obviously  Kokotovic's  results 
provide  a  two  time  scale  decornposition  of  a  linear  system.  Moreover,  as 
mentioned  earlier,  this  approach  has  been  proven  to  be  useful  in  optimal 
control  [4],  stochastic  control  18],  design  of  nonlinear  regulators  [5], 
analysis  of  high  gain  systems  113],  cheap  control  [2,  12,  13]  and  so  on. 
The  limitations  of  this  method  consist  of  two  points.  First,  the 
assumed  form  of  Eq.  (2.3)  essentially  implies  that  the  seperation  of 
fast  and  slow  state  variables  has  been  done  beforehand.  This  is, 
however,  not  the  general  case,  especially  for  oonplex  systems,  where  the 
system  matrix  could  be  a  more  general  function  of  e,  and  state 
seperation  would  not  be  immediately  available.  Secondly,  only  two  time 
scales  are  considered  in  this  model.  In  the  next  subsection  we 
introduce  the  result  of  Coderch,  et.al  [7,  1]  which  successfully  solve 
the  general  problem  of  multiple  time  scale  behavior  of  a  perturbed 
linear  system. 


2.2.2.  Multiple  Time  Scale  Behavior  of  Singularly  Perturbed  LIT  Systems 

The  system  considered  in  17,1]  is  the  singularly  perturbed  LTI 
system  (2.2)  with  a  slight  change  in  notation  whose  purpose  will  become 
clear  shortly. 


x(t)  *  AQ(e)x(t) 


(2.8) 


where  the  nxn  matrix  Aq(c)  is  an  analytic  function  of  e.  For  the 
present  discussion  we  assume  that  Ag(e)  is  Hurwitz  for  e£  (0,Cq)  although 
Cocerch  allows  slightly  weaker  assumptions.  In  Section  2.8  we  will  show 
how  our  approach  can  be  easily  extended  to  this  more  general  case. 

In  order  to  present  the  results  in  [7,1],  we  need  several 
definitions.  A  matrix  H  is  said  to  have  semisimple  nullstructure  (SSNS) 
if  its  zero  eigenvalue  has  geometric  multiplicity  equal  to  its  algebraic 
multiplicity.  In  other  words,  every  zero  eigenvalue  of  H  corresponds  to 
a  distinct  independent  eigenvector.  A  matrix  H  is  said  to  be  semi  stable 
if  it  has  SSNS  and  all  its  non-zero  eigenvalues  have  negative  real 

parts.  Suppose  Aq (e)  has  eigenvalues  A^e) . ,A n(e)  where  Ai(e)->0, 

e->0,  i=l,....,m£n.  Then  the  total  projection  for  the  zero-group  of 
eigenvalues  of  AQ(e),  Pq(c)  is  the  projection  onto  the  subspace  spanned 
by  eigenvenctors  corresponding  to  A^e),...., Am(e)  of  A0(e)  [9]. 

Since  Ag(e)  is  analytic  at  e=0,  it  has  a  series  expansion  of  the 

form 

A0(e)  =£  ePF0p 
P=0 

It  can  be  proven  [7,1]  that  if  Fqq  has  SSNS,  the  matrix 
A^e)  =  P0(e)A0(e)/e 
has  a  series  expansion  of  the  form 


28 


AX(e)  =ZePFlp 

p=0 


If  P10  also  has  SSNS  we  define  A2(e)  as 

A2(e)  =  P1(c)A1(e)/c  =  P^OPoCejAofeJ/e2 
P=0 

where  P-^e)  is  the  total  projection  for  the  zero-group  of  eigenvalues  of 
A1(c).  This  process  can  be  continued  but  it  terminates  at 

Ajute)  =  Pm-i^Aj^eJ/e  =  P^te)  ....PQteJAoCe)/^ 

p=0 

if  the  matrix  Fm0  does  not  have  SSNS  or  if 
rankF0o+rankF10+. . .+rankFm0=^i. 

A  matrix  Ag(e)  is  said  to  satisfy  the  multiple  semis iiqple  null 
structure  (MSSNS)  condition  if  the  sequence  of  matrices  Ak(e)  can  be 
contructed  up  to  a  stage  k=m  with  all  matrices  F^g,  k=0,....,m  having 
SSNS  and  rankFgg+....+rankFmg=n.  If  in  addition,  all  Fk0  are 
semistable,  then  we  say  that  Ag(e)  satisfies  multiple  semistability 
(MSST)  condition. 

The  following  results  determines  when  an  asymptotic  approximation 
to  x(t),  uniformly  valid  for  t*0,  can  be  constructed  which  clearly 
displays  the  multiple  time  scale  behavior  of  x(t).  The  main  result  is 
the  following. 


1.  System  (2.8)  has  well-defined  time-scale  behavior  if  and  only  if 
Afl(e)  satisfies  MSST  condition. 

2.  If  AQ(e)  satisfies  the  MSST  condition.  Then 

lim  sup  ||exp{A(e)t}  -  exp{ 
e->0  t20 

Furthermore,  it  can  be  shown  that  using  a  linear  transformation  T,  which 
is  independent  of  e  ,  we  have  that 

l*-  F^qT  =  diag{0,0, . . . ,0,^.0, . . . ,0},  k=0,»..,m 

where  each  of  the  A^  is  Hurwitz.  Therefore 

lim  sup  I  |exp{Ao(e)  t}  -  T_1exp{  £  TFifn^e^tjTl  |  =0 
e->0  tj>0  k=0 

or 

lim  sup  exp{lA0(e),r1}  -  diag{exp[A0t],....,exp[Amemt]}  =0 

(2.10) 

In  other  words,  to  first  order  approximation,  the  original  system  (2.8) 
can  be  thought  of  as  being  conposed  of  (mfl)  uncoupled  subsystems  (after 
certain  e-independent  transformations) 


E/kO^J  n  =  o 

k=0 


<fyk(t)/dt  =  A^y^t)  ,  k=0,....,m 

each  running  at  a  different  time  scale.  In  this  case  we  will  say  that 
(If  ioi— £»>  defines  a  time  scale  decomposition  sf  12>S). 

Prom  this  result  it  can  be  seen  that  in  order  to  obtain  the 
uncoupled  approximation  one  must  first  compute  F^q,  k=0,....,m,  then 
compute  T.  Although  a  procedure  was  proposed  in  [7],  it  really  should 
not  be  viewed  as  an  algorithm,  since  the  computational  aspects  of  the 
procedure  have  not  been  examined,  and  the  procedure  is  quite  involved. 
For  example,  F10,  F20,  F30  030  obtained  by 

F10  =  P0A01P0 

F20  =  plp0 (*02"*01*00**0l)p0Pl 

f30  =  P2P1P0(*03"*01*00#*02”*02*00#*01 

+A01*00#*01*00#*01”*02*10#A02 

A  A  A  4 

+*02*10  *01*00  *01"*01*00  *01*10  *02 

”*01*00  **01*10**01*00**01 )  PC^1P2 

where  P^limP^e),  i=0,l,2,  and  A*  is  the  pseudo- inverse  of 
A  €-*o 

matrix  A.  The  computation  of  F4q  is  much  more  complicated.  The 
complexity  of  these  formulas  makes  it  difficult  to  obtain  a  deep 
understanding  of  multiple  time  scale  behavior  and  the  structure  of  Aq(c) 
or  to  examine  a  variety  of  problems  such  as  the  consideration  of  the 
effect  of  control  on  time  scale  behavior. 


Having  these  background  results  we  are  now  in  a  position  to 


introduce  our  main  results.  As  we  mentioned  in  Chapter  1  and  earlier  in 
this  chapter,  an  important  element  in  our  development  is  the  Smith  form 
of  A(e)  over  a  local  ring  T  of  all  functions  of  e  analytic  at  e=0.  In 
next  section  we  give  the  definition  and  the  major  properties  of  this 


2.3  The  Local  Ring  T 


In  this  section  we  study  the  ring  T  of  all  functions  of  real 
variable  e  which  are  analytic  at  e=0.  In  other  words,  we  study  the  ring 
of  functions  which  have  Taylor  series  expansions  at  e=0.  We  show  that 
this  ring  have  a  Smith  form.  It  is  this  Smith  form  that  plays  an 
important  role  in  this  thesis.  We  state  many  of  the  results  ooncering  T 
and  matrices  over  T  without  proof  because  they  are  immediate  extensions 
or  examples  of  results  which  can  be  found  in  literature  [3,11]. 


Consider  a  set  T  which  consists  of  all  functions  of  e  which  are 
analytic  at  e=0.  It  is  easy  to  show  that  T  is  a  ring.  The  units  of  T 
are  elemlnts  of  T  which  do  not  vavish  at  e=0, 


u  =  {  x |  xer,  x(0)Ho} 


Therefore  any  unit  of  T  has  the  form 


Mtlii 


with  aQ\0.  Define  the  degree  of  x,  a(x),  to  be  the  order  of  the  first 
nonzere  term  in  its  Taylor  expansion.  (For  example,  if  x=a2e^+a3G^+«»* 
and  a2^0,  then  the  degree  of  x  is  two).  Then  T  is  a  Euclidean  domain 
with  degree  function  a{x).  If  for  some  x,  a(x)=i,  then  x  equals  e1 
times  a  unit.  In  other  words,  each  x  is  equivalent  (modulo  units)  to 

one  of  the  element  of  set  {l,e,e^, . }.  Let  f  and  g  be  two  elements 

of  T.  Then  it  is  easy  to  see  that 

1.  f  divides  g  iff  a(f)^a(g). 

2.  f  and  g  are  coprime  iff  at  least  one  of  them  is  not 
equal  to  zero  at  e=0. 

Let  M  (T)  denote  the  set  of  matrices  whose  elements  belong  to  T.  Then 
the  set  of  unimodular  matrices  is  defined  as 

U(T)  =  {G|G(M(T),  |G  (0)1^0} 

Because  T  is  a  Euclidean  domain,  it  is  a  principle  ideal  domain. 
Therefore  any  matrix  AfM(T)  can  be  transformed  to  its  Smith  form. 

Namely  for  any  matrix  AfM(T)  we  have 

A=PDQ 

where  P,Q«U(T)  and  D  has  the  form 

D  =  diag  {e1^,....,  eir,  0,....,0  } 


<i 


r* 


where  the  integers  ij  are  ordered  so  that 


•  ••• 


The 


\r* 


uy.  v.v.vivv  K~  <."  K.~  J.  VV*V  V~J  -rz  V-.  vry  vy  v.x;- 


quantities  e1!,  ....  are  called  the  invariant  factors  of  A(e).  The 
elementary  row  and  column  operations  used  in  bringing  A  into  its  Smith 
form  have  the  form 

1.  Interchange  of  any  two  columns  (or  rows) . 

2.  Addition  to  any  column  (row)  of  any  other  column  (row) 
multiplied  by  an  element  in  T. 

3.  Scaling  any  column  (row)  by  any  element  in  U(T). 

Let  A,  B  €  M(T)  have  the  same  number  of  rows.  Then  A  and  B  are  left 
coprime 

iff  F=  [A  B]  has  the  Smith  form  P[I  0]Q. 
iff  F  has  full  row  rank  at  e=0. 

It  can  be  shown  that  this  ring  is  a  local  ring  because  its  maximal 
ideal,  namely  <e>  is  unique. 

As  pointed  out  in  Chapter  1,  system  (2.2)  can  be  approximated  by 
its  explicit  form.  Further  work  on  our  algebraic  approach  is  bassed  on 
this  form.  Therefore  in  next  section  we  derive  the  explicit  form  of 
(2.2)  using  the  Smith  form  of  A(e). 
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2.4.  THE  EXPLICIT  FORM  FOR  A  SINGULARLY  PERTURBED  AUTONOMOUS 


LINEAR  SYSTEM 


2.4.1.  Introduction 

In  this  section  we  show  how  a  general  system  {22),  with  A(e)  being 
a  matrix  over  T,  can  be  put  into  what  we  term  its  explicit  form.  As 
mentioned  before,  it  is  the  explicit  form  that  makes  it  possible  to 
connect  the  Smith  form  of  A(e)  with  the  multiple  time  scale  behavior  of 
the  system.  This  provides  considerable  insight  into  the  structure  of 
such  systems.  Also,  the  explicit  form  permits  us  to  develop  an  algorthm 
for  const  rut  ing  the  time  scale  decomposition  of  (2.2)  that  makes  use  of 
Schur  complements  and  that  makes  clear  the  computations  required  to 
determine  the  time-scale  decomposition.  This  form  also  allows  us  to 
pose  and  answer  a  variety  of  questions  in  subsequent  chapters  and 
sections  of  this  chapter. 


2.4.2.  Explicit  Form 


As  mentioned  in  Section  2.3,  an  nxn  matrix  A(e)  over  T  has  its 
Smith  form 


A(e)  =  P(e)D{e)Q(e) 


where  P(e)  and  Q{e)  are  unimodular,  namely  | P (0)  |  ^  0,  I Q (0)  I  H  0  and 
D(e)  is 


D(e) 


— 

d^e) 

• 

• 

0 

dr(e) 

0 

0 

_ 

where  dj(e)=c1jf  j=l,....,r,  and  0<ij^ik,  j<k. 

consequently  we  can  rewrite  the  system  (2.2)  as 

x  =  P(e)D(e)Q(e)x  (2.11) 

Because  P(e)  is  unimodular,  P_1(e)  exists  in  the  neighbourhood  of  e=0. 
Later  on  we  shall  see  that  we  can  use  P(e)  as  a  similarity 
transformation  on  the  state  without  affecting  the  time  scales  of  the 
system.  In  particular,  multiplying  by  P_1(e)  on  the  left  of  both  sides 
of  Eq.  (2.11)  we  have 


P(e)  ^x  =  D(e)Q(c)P(e)P(e)  Xx 


Let 


Then 


y  =  P(e)“^x 


y  =  D(e)Q{e)P(e)y 


If  we  define 

S(c)  =  Q{e)P(e) 

we  have 

y  =  D(e)S(e)y 

=  diag{  I1#  el2,....,  en“1In_1}  R(e) 


(2.12) 


Eq.  (2.12)  is  called  the  explicit  form  because  t»(e)  explicitly  reveals 
the  time  scales  of  thes  system.  Recall  that  Q(e)  and  P(e)  are 
unimodular,  and  therfore  so  is  A(e). 

In  the  next  section  we  shall  consider  (2.12)  and  find,  under  a 
particular  set  of  conditions,  the  time  scale-decomposed  approximation  of 
y  (after  appropriate  similarity  transformations).  Before  we  do  that, 
let  us  overview  some  of  the  major  results  to  be  devoloped  in  the 
following  sections. 

2.4.3  An  Overview 


What  we  are  interested  in  doing  is  investigating  the  time  scale 


decomposition  of  (2.2),  or  equivalently  of  (2.12).  Coderch's  approach 
(Section  2.2.2)  provides  a  general  method  for  doing  this  involving  the 
confutation  of  the  total  projection  for  the  zero  group  of  eigenvalues  of 
A(e)  and  of  each  of  the  subsequent  system  matrices  A^(e),  A2(e),  .... 
defined  in  this  procedure.  All  of  these  confutations  involve  using  the 
complete  e-dependent  projection  matrices.  On  the  other  hand,  as  we  will 
discuss,  if  one  has  a  system  in  a  form  analogous  to  that  of  Kokotovic's 
treatment  (Section  2.2.1),  the  computation  of  these  projections  is 
straightforward  and  transparent,  as  is  the  check  of  the  MSSNS  and  MSST 
conditions.  What  our  approach  does  is  to  transform  the  system  so  that 
this  straightforward  construction  can  be  applied.  In  this  process,  we 
in  fact  throw  away  certain  parts  of  the  e-dependency  of  the  system 
matrices  that  are  unimportant  in  obtaining  a  time  scale  decompositions 
if  in  fact  the  original  system  has  a  time-scale  decomposition.  In  a 
sense  what  this  does  is  to  minimize  the  number  of  c-dependent 
computations  that  must  be  performed,  thereby  making  far  more  clear  what 
the  critical  e-dependencies  are  in  A(e),  its  total  projection  onto  the 
zero  group,  and  those  of  its  successors  (A-^(e),  A2(e),  etc.). 

To  be  more  specific,  let  us  assume,  both  here  and  in  the  next  three 
sections  that  A(e)  in  (2.2)  is  Hurwitz  for  e>0.  Now,  if  we  obtain  a 
time-scale  decomposition  for  y  in  (2.12),  it  is  straightforward  to 
obtain  a  time-scale  decomposition  for  x  in  (2.2)  using  an  e- independent 
similarity  transformation.  Specifically, 
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Lemma  2*1: 


B 


B 


B 

K 


T-  ;  7-.  ^  ■  , 


Suppose  that  (T;  Aq,  ...,  A^)  defines  a  time-scale  decomposition  of 
the  explicit  form  dynamics  (2.12).  Then  (TP“^(0);  Aq,  ....,  A^)  defines 
a  time-scale  decomposition  of  the  original  system  (2.2). 

Proof: 

It  is  straightforward  to  check  that  this  lemma  states  is  equivalent 
to  the  following.  Note  that 

x(t)  =  P(e)y(t) 

Define 

xx(t)  =  P(0)y(t)  (2.13) 

Then 

lim  sup  !  | x (t)  -  xWt)  1 1  =  0  (2.14) 

e->0  t20 

That  is,  (2.13)  is  an  asymtotically  accurate  approximation  of  the 
solution  of  (2.2). 

To  proceed,  note  that 

1 1  x  (t)  -  xx(t)  Hi  I  |P(e)  -  P(0)  1 1  |  |y(t)  1 1 
So  that  (2.14)  will  be  proved  if  we  can  show  that 

lim  sup  I  |y(t)  1 1  =  M  < 
e->0  t^O 


f 


However,  by  hypothesis 


y(t)  =  T"^-diag  {exp  [Agt] ,  ...  ,  expl^e*1^]  }Ty(0) 

where  each  of  the  is  Hurwitz*.  Since 

sap  |  lexpULeH]  1 1  =  sup  ||exp[Ait]||  <  00 

t>0  t>0 

the  result  is  proved. 

Note  that  one  consequence  of  the  lemma  is  that  (2.2)  has  a  time- 
scale  decomposition  if  and  only  if  (2.12)  does.  However,  while  this 
lemma  tells  us  a  little  bit  about  those  e-dependencies  that  can  be 
thrown  out,  there  is  far  more  that  can  be  said. 

Specifically,  recall  that  £(e)  in  (2.12)  is  unimodular. 
Intuitively,  what  this  means  is  that  K(e)  has  no  "structure  at  e=0". 
For  notational  simplicity,  let  us  denote  A(0)  as  A,  and  consider  the 
system 

z  =  D(e)Sz  (2.15a) 

Let  us  also  write  5  In  block  form  compatible  with  the  diagonal  block 
sizes  of  D(e) 


+  That  the  A^  are  Hurwitz  follows  from  the  assumption  that  A(e) 
is  Hurwitz  for  ef (0,e0] . 


As  we  saw  in  Section  2.2.1,  the  Schur  complement  play  an  important 
role  in  defining  the  time-scale  decomposition  of  the  system  considered 
by  Kokotov ic.  A  similar  statement  is  true  here.  Specifically,  in  the 
next  three  sections  we  define  a  sequence  of  matrices  Aq  9  ••••  9  Amanda 
similarity  transformation  T  obtained  by  successive  Schur  complementation 
of  ff,  and  we  provide  two  derivations,  one  based  on  the  approach  of 
Kokotovic,  and  a  second  making  explicit  contact  with  the  results  of 
Coderch,  of  the  following  (again  under  the  assumption  that  A(e)  is 


ahsacsm  hi-- 


Consider  the  original  system  (2.2)  and  the  explicit  form  system 
(2.12).  These  systems  have  well-behaved  time-scale  decompositions  if 
and  only  if  the  reduced  explicit  form  system  does.  Furthermore,  these 
well-behaved  time  scale  decompositions  exist  if  and  only  if  Aq,  ...  ,  A^ 
mentioned  above  all  exist  and  are  Hurwitz.  In  this  case  (T;  Aq,  ....  , 
Ajp)  defines  a  time-scale  decomposition  for  both  (2.12)  and  (2.15),  and 
(by  lemma  2.1),  (TP“^(0);  Aq,  ...  ,  Am)  defines  one  for  the  original 
system  (2.2). 

Let  us  make  several  important  comments  about  this  result. 
Specifically,  once  we  have  determined  P (0)  and  ff=Q(0)P(0),  all  of  the 
remaining  calculations  are  e- independent.  Thus,  we  have  identified  the 
critical  e-dependent  computations  as  the  determination  of  P(0)  and  0(0) 
in  the  Smith  decomposition  of  A(e).  Finally,  note  that  one  point  of  the 
Theorem  is  that  if  (2.15)  has  a  well-behaved  time  scale  decomposition, 
then 


lim  sup  | |y(t)  -  z(t) I |  =0 
e->0  t20 

However,  as  is  illustrated  in  Section  2.7,  this  need  not  be  true  if  a 
well-behaved  time  scale  decomposition  does  not  exist.  In  fact,  while 
A(e)  is  Hurwitz  for  e>0  by  assumption,  and  D(e)S(e)  =  P”^(e)A(e)P(e)  is 
obviously  Hurwitz  as  well,  D(e)A  need  not  be  Hurwitz  for  e>0.  Thus,  if 
the  original  system  has  a  well-behaved  time  scale  decomposition,  we  can 


throw  away  the  e-dependent  terms  in  A(e).  However,  if  such  a  time-scale 
decomposition  does  not  exist,  the  e-dependent  terms  of  R(e)  represent 
critical  components  of  the  damping  in  the  original  system  (2.2).  In 
Section  2.8  we  use  this  observation  to  define  time  scale  decompositions 
in  a  slightly  weaker  sense  by  pinpointing  and  keeping  these  e-dependent 
terms  in  £(e)  that  are  critical  to  system  stability.  This  construction, 
posed  as  an  open  problem  by  Coderch,  is  but  one  example  of  the  problems 
that  our  framework  allows  us  to  solve. 


2.5.  DERIVATION  FROM  TWO  TIME  SCALE  RESULTS 

2.5.1.  Introduction 

As  we  stated  in  the  last  section  (Section  2.4.3),  finding  the  time 
scale  decomposition  of  (2.2)  is  equivalent  to  finding  that  of  (2.12). 
In  this  section  we  shall  show  that  if  the  successive  Schur  complements 
AiA  are  Hurwitz  then  (T;  An,  ...  ,  A^)  defines  a  time  scale 
decomposion  for  both  (2.12)  and  (2.15),  and  (TP_1(0);  An,  ...,  a  ) 
defines  one  for  the  original  system  (2.2).  The  proof  is  in  fact  an 
extension  of  the  well-known  method  used  in  the  two  time  scale  case. 
Then  in  the  next  section  we  shall  relate  our  approach  to  the  multiple 
time  scale  results  obtained  by  Coderch  et3l,  and  this  will  allow  us  to 
prove  the  full  version  of  Theorem  2.1,  that  is,  that  the  Ai;i  being 
Hurwitz  is  necessary  as  well  as  sufficiant  for  the  system  (2.2)  to  have 
a  well-defined  time-scale  decomposition. 


2.5.2.  Derivtion  From  Two  Time  Scale  Results 


The  basic  idea  behind  this  approach  is  to  block-diagonal ize 
A1(e)=D(€)S(e).  It  uses  well  known  two  time-scale  results  repeatedly  to 
"peel  off"  each  time  scale.  So  first  let  us  review  some  of  those 
results.  Suppose  the  system  we  are  considering  has  the  following  form 


x  *  A(e)x 


where 


A(e)  = 


A11  a12 


cA21  ^22 


It  is  proven  [J7J  that  if  we  define 


T(e)  = 


”L(e) 


eH(c)  I-eH(e)L(e) 


(2.16) 


where  H  and  L  satisfy 

A12(e)-An(e)L-€L(A22(c)+A2i(e)L)  =  0  (2.17) 

H(A11(€)-LA2l(c))-e(A22(G)+A21(c)L)H+cA21(e)=0  (2.18) 


We  have 


T(e) 

A^(e)  A^fe) 

T_1(c)= 

G^e)  0 

* « * 
k  .1  M*s 

(2.19)  ’* 

eA21(e)  eA22(G) 

0  G2(e) 

r-y^ 

< 

r. 

where 

(e)  =A]j  (e)-eL  (e)  A21  (e) 

G2  (e)  =&22  (®)  +^21  ( ®)  ^  ( e)  (2.20) 


Solution  to  (2.17)  and  (2.18)  exist  as  long  as  a-q(O)  is  invertible  and 
e  is  sufficiently  small.  Note  that  under  these  conditions  T(e)  is 
unimodular  and,  as  e->0,  L(6)->-A;q”^(0)A;l2(0),  H(e)->0. 


Now,  consider  our  system  in  explicit  form  (Eq.  (2.12)) 


An(e) 

€^21(0) 


Alin(e) 

el2A2m^e^ 


21 

z2 


1 

Zm 


emAmi(e) 


Zm 


(2.21) 


Let  us  apply  this  procedure  m-1  times  in  succession  to  A^(e),  where 
at  each  stage  we  "peel  off"  one  of  the  time  scales  of  (2.21),  starting 
from  the  fastest.  We  begin  by  assuming  that  Ajj(O)  in  (2.21)  is 
invertible  and  apply  the  procedure  just  described  to  A^Jg),  with 
"^22  (g)  "  identified  as  the  large  lower  right-hand  block  matrix  in  (3.7) 
and  with  "A12(e)"  and  "eA21(e)n  defined  in  a  corresponding  manner.  Thus 


we  construct 


Tx(e)  = 


-L^c) 


e^H-^e)  i-e^H^ejLjJe) 


(2.22) 


where,  as  c — >0,  H(e) — >0,  G^(e) — >A^,  and 

L^(e)  =  [L^2(®)  ••••L]JI1(e) ) — >A-q  ^'[A^2....A2Jn] 


A22 (e) 


A2m<e> 


= 


eim-i2^m2 (e)  ....  eMI"l2Anin(e) 


Aij  (e)  «  Aj^j  (e) (e)  >Aij“AilAll  A^j  =  Aij 


where 


v  -  t  v  av  ■  xv  \v.wwr4' 


^  v 


~r\  ”ri~'.  ~v  ■  -« 


I 

i 

s 

i 

i 

i 

f 

K 

| 


We  now  see  that  if  A22  is  invertible  we  can  repeat  this  procedure  on 
G2(e).  Continuing,  we  finally  obtain 

Vi (e) . . . .Tx (e)  Ax (e) T2_1  (e) . .  .T^"1  (e) 

=  diag{G1(e),  ei2G2(e) e111^^) } 

provided  that  An  =  G^O),  A22  =  G2(0),....,Ain_1,in_1  =  Gm-1(0) 
are  all  invertible.  Define 

A(e)  =  diag{Au,  e12^,....^1™^) 

We  shall  prove 

Ibegcgp  2*2: 

If  are  all  Hurwitz  then  (T;  Allf  ...f  Aj^)  defines  a 

time-scale  decomposition  for  both  (2.12)  and  (2.15)  and  (TP”^(0); 
An,...,  W  defines  one  for  the  original  system  (2.2). 

To  prove  this  theorem  we  need  the  following  lemma. 

Lama  2.2 

Let  E(e)  be  an  nxn  matrix  with  entries  from  T.  If  E(0)  is  Hurwitz, 

then 


lim  sup  ||exp{E(e)t}  -  exp{E(0)t}||  =  0 
e->0  t20 
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It  is  well  known  [9]  that 

exp{E(e)t}  =  exp{st}R(s,E(e))ds 

r 

where  p  is  a  positively-oriented  contour  enclosing  all  eigenvalues  of 
E(e)  and 


R(s,E(e))  =  (sI-E(e))-1 

Note  that  for  e  small  enough  E(e)  is  Hurwitz  and  r  can  be  chosen  to  lie 
in  the  left-half  plane  and  to  enclose  all  eigenvalues  of  E(0)  as  well  as 
E(e).  Therefore  we  have 

I  |exp{E(e)t}  -  exp{E(0)t}  I  I  * 

1/2TJI  |  j  exp{st}  [R(s,E(e))  -  R(s,E(0))]ds|  I 
S.  l/2TTexp{-at}  I  |^R(s,E(e))  -  R(s,E(0) ) I  Ids 

r 

where  a  is  a  positive  number.  The  uniform  convergence  of  R(s,E(e))  to 
R(sfE(0))  on  p  [9]  then  proves  the  lenma. 

Proof  Of  Theorem: 

As  a  first  step  we  note  that  by  lemma  and  the  assumption  that 
An,...fAmm  are  Hurwitz 

lim  sup  I  |diag[exp{G1(e)tr....fexp{e1IT,Gin(e)t]  - 
e->0  t>0 


diag[exp{A11t},...,exp{e^Anmt}]  1 1 


m 

£  lim  Z.  sup  | |exp{Gr{e)t}  -  exp{Arrt} I |  =  0 
e->0  r=l  t20 

Thus,  since  T(e)  is  unimodular,  we  have 

lim  sup  |  |exp{Ai(e) t}  -  ^-^(eJexpCACeJtjTCe)  1 1  =  0 
€>->0  t20 

From  the  proof  we  can  see  that  this  result  holds  not  only  for  (2.12), 
but  also  for  (2.15).  Thus  we  have  proved  the  first  part  of  Theorem  2.2. 
To  prove  that  (TP-1(0);  A-q,...,  defines  a  time-scale  decomposition 

for  the  original  system  (2.2)  we  need  just  invoke  Lemma  2.1. 

As  a  final  oonment  we  note  that  the  recursive  procedure  for  peeling 
off  successively  slower  time  scales  actually  yields  a  sequence  of 
approximations  over  longer  time  intervals  if  exp{A(e)t}  is  replaced  by 
diagfexpLA-jjt] ,...., exp [Arreirt] ,  I,...., I},  we  obtain  a  uniform 
approximation  over  an  interval  of  the  form  [0,  e1”ir+^J  (see  [7]  for  a 
similar  corrment) . 

The  results  in  this  section  show  that  the  Hurwitz  condition  is  a 
sufficient  condition  for  the  system  (2.2),  (2.12)  and  (2.15)  to  have 
well-behaved  time-scale  and  provide  the  actural  time-scale 
decomposition.  As  mentioned  previously,  in  the  next  section  we  show, 
based  on  Coderch's  results,  that  this  condition  is  also  neccessary. 
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2.6.2  The  Schur  Complement 


Consider  the  matrix 


B  = 


B11  b12  ••••Bln 


Bnl  Bk2  • *  *  *Bkk 


where  the  Biif  i=l,...,n  are  square  matrices.  Define 


Yk  = 


B11  b12  ****Blk 


Bkl  Bk2  •***Bkk  J 


k<n 


Then  the  Schur  complement  of  Yk  with  respect  to  is 


Bkk  =  Bkk~[Bkl - Bk.k-1^ <Yk-l>”1[Blk'‘'*Bk-l,k 


If  we  define 

Sj^j  *  (Yk_i)  i=l/..../k— 1 

The  Schur  coplement  can  be  written  as 

Bkk  “  Bkk”lBkl***'Bk,k-ll  Iskl' - sk,k-l'J' 

k-1 

=  Bkk"^  Bkiski 
i=l 


(2.26) 


(2.27) 


A  number  of  properties  of  Schur  complements  are  described  in 


Appendix  1,  which  we  shall  find  useful  m  proof  of  our  main  results. 
In  addition,  there  is  one  other  relatively  simple  property  of  Schur 
complements  that  we  are  in  essence  using  in  this  section  but  whose 
derivation  is  deferred  until  Section  2.7  where  we  derive  other  related 
results.  Specifically,  in  Section  2.5  we  define  the  Aj^  as  successive 
Schur  complements.  That  is  A]j  is  simply  the  upper  left-hand  block  of 
A.  Then  we  perform  a  Schur  complement  of  A  compatible  with  the  block 
structure  shown  below 


A11  a12 
A21  a22 


Aml  Am2 


A11  c12 
C21  C22 


So  that 


^22  =  c22  "  C21All~lc12 


and  A22  is  then  simply  the  upper  left-hand  block  of  022*  The  procedure 
then  continues  step  by  step.  On  the  other  hand,  the  Schur  complement 
Akk  defined  as  in  (2.  27)  if  the  B's  are  replaced  by  A's,  is  obtained  in 
one  step.  That  these  two  computations  yield  the  same  result  is  shown  in 
Section  2.7.2. 


2.6.3.  Main  results 


The  main  result  in  this  section  is  the  proof  of  Theorem  2.1,  based 
on  Coderch's  results.  The  basic  idea  behind  the  proof  is  to  construct 
the  succesive  projection  matrices  and  pseudo- inver sees  used  in  Coderch's 
approach.  By  using  the  explicit  form  for  these  computations  we  can  then 
directly  relate  the  MSSNS  and  MSST  conditions  to  the  condition  that  all 
of  the  Aj^  are  Hurwitz.  Since  the  p.oof  is  tedious  we  leave  it  in 
Appendix  A2  at  the  end  of  this  chapter.  One  byproduct  of  this 
computation  is  the  explicit  identification  of  the  similarity 
transformation  T  needed  in  the  time  scale  decomposition.  Thus  what  we 
actually  obtain  is  the  following: 

Theorem  2.3: 

Consider  system  (2.2)  and  its  explicit  form  (2.12)  and  reduced 
expl  icit  form  (2.15).  Let  the  system  matrices  of  (2.15)  be 

A11  A12  ••••  Aln  1 


Anl  *n2  *nn 


and  define 


where  the  Aiif  i=l,....,n  are  the  Schur  complements  of  A  as  defined 
before.  Then  we  have 

1.  Systems  (2.2),  (2.12)  and  (2.15)  have  well-behaved  time- 

scale  decompositions  if  and  only  if  the  i=l,...,n 

are  all  Hurwitz. 

2.  If  the  A^  are  all  Hurwitz,  then  (T;  A-q,  ...  ,  A^) 
defines  a  time-scale  decomposition  for  both  (2.12)  and 
(2.15),  and  (TP_1(0);  An,  ...,  A^)  defines  one  for 
the  original  system  (2.2),  where 


and  the  Si;j,  i=2,  ...,  n,  j=l,  ...,  n-1  are  defined  in  Eq.(2.26)  if  we 
replace  with  Aj^.  A  direct  consequence  of  this  result  is  the 

following. 

Corollary: 

The  following  conditions  are  equivalent: 

1.  A(e)  and  its  explicit  form  satisfy  the  MSSNS  (MSST) 
condition. 

2.  hi  i  are  of  full  rank  (Hurwitz). 


2.7  Relationship  Between  The  Two  Derivations  And  The  Uniqueness 
Of  The  Time  Scale  Decomposition 

2.7.1.  Introduction 

In  this  section  we  show  first  that  the  Aj^  in  section  2.5  are 
nothing  more  than  the  Schur  complements  in  Section  2.6.  We  then  turn 
our  attention  to  the  question  of  the  uniqueness  of  the  time  scale 
decomposition.  Specifically,  we  show  that  although  the  matrices  P(e) 
and  Q(e)  are  not  unique  (because  of  the  non-uniqueness  of  Smith  form 
decomposition)  the  Schur  complements  of  A=Q(0)P(0)  corresponding  to 
different  choices  of  P(e)  and  Q(e)  are  similar.  Therefore  different 
Smith  decompositions  give  the  same  time  scale  result  up  to  a  similarity 
transformation  at  each  time  scale. 

2.7.2.  Recursive  and  nonrecursive  computation  of  Schur 

complements 

To  prove  that  A^  as  defined  according  to  Eq.(2.27)  equals  Aj^  in 
Eq.{2.23)  we  need  the  following  lemma. 


Lenina  2.3: 

Consider  an  invertible  matrix  A 


An  Ai2  a13 
A  =  A2i  A22  A23 
A31  A32  A33  _ 

Define 

Fij=Aij"AiiAif1A1j,  if j/=2f3 

Then  the  Schur  complement  of  A33  is 

A33=F33“F32f22~1f23 


(2.2 


(2.2 


(2.32) 


Proof:  Consider  the  linear  equation 

A[x'  y'  z']'  =  [0  0  b']'  (2.: 

It  is  easy  to  show  that  the  solution  is  z=A33~*b.  We  can  also 
write  the  solution  as 


All”1 tA!2 


57 


fF22  f23 


N 


a 


i  *. 

I 


p 


F22 

f23 

y 

0 

F32 

F33 

z 

b 

Solving  (2.34)  for  z  we  have 

z= [F33-F3222~1f23 1 ~lb 


(2.34) 


(2.35) 


Eq.(2.35)  is  true  for  all  possible  b.  Also  the  solution  for  Eq.(2.33) 
is  unique.  Therefore  we  have  proved  the  lemma. 


Note  that  f33“F32f22~1f23  is  nofching  nore  than  the  A33  defined  in 
Section  2.5.2.  Hence  we  can  conclude  that  A33  defined  in  Section  2.6  is 
the  same  as  that  defined  in  (2.26).  Repeatedly  using  Lemma  2.2  it  is 
easy  to  prove  that  this  is  true  for  all  k=l,...,n.  * 


2.7.3.  Similarity  of  Schur  complements  of  different 
decompositions 

In  this  subsection  we  show  that  although  for  different  PDQ 
decomposition  the  matrices  Q(0)P(0)  are  different,  their  Schur 
complements  are  similar.  We  first  show  how  the  Schur  complement  of  a 
matrix  is  influenced  by  a  left  (right)  multiplication  by  a  lower 
(upper) triangular  matrix.  Then  we  prove  the  similarity  between  Schur 
complements  of  different  decompositions. 


Lenina  2.4 :  Consider  a  matrix 


where  A  is  invertible.  Consider  the  Schur  complement 
D=D-CA-1B 


and  suppose  that  D  is  invertible.  Let 


where  and  are  invertible.  Let 
61  =  DI-CiA1“1B1 


Then 

Dx  =  V]D 

Let 


‘A 

b"i  r  t2  v2- 

_  a2 

Sj  = 

[c 

dJ  Lo  v2. 

u 

°2- 

where  T2  and  V2  are 

invertible. 

Let 

D2  =  D2“C2a2  1b2 

Consider  the  linear  equation 


r*  Bi 

X 

"O' 

— i 
Q 

U 

- 1 

_b_ 

.a. 

The  solution  is 


a 


C 


By  left  multiplying  by 


on  both  sides  of  (2.36)  we 


have 


r  " 

- 

A1  B1 

X 

0 

Cl  Dj. 

L  J 

y 

Via 

L  A  J 

Hie  solution  of  (2.38)  is 


y 


*  Dj.  V 


c 


Combining  (2.37)  amd  (2.39)  we  have 

D"1  =  D1~1V1a  C 

Because  (2.40)  holds  for  all  a€I^,  we  have 
D"1  *  D}'1^ 


or 

^  =  V^D 


Now  note  that  (2.36)  can  be  rewritten  as 


or 


A  B 

C  D. 


t2  u2 


0  v. 


2-1 


r  T2“1x-T2"1u2v2~1y 


v2'1y 


h:i 


Let 


*1  =  T2-xl-T2'lu2v2-1y 


We  have 


Then 


G  XJ-Q 


v2_1y  =  D2_1a  =  v2_1D“1c 


Therefore 


d2  =  w2 

Suppose  A(e)  has  two  deoonpositions: 


A(e)  =  P^eJDteJQ^e)  =  P2(e)D(e)Q2(e) 

D(e)  =  diag{lQr  elp  =  diag{d^  9  ••••  }  (2. 


Where  dA  ■  e3*,  O^a^a^ - ia^  Then  we  have 


Lemma  ZJr. 

The  Schur  conplement  of  Q^(0)P^(0)  and  Q2(0)P2(0)  are 
similar. 


Proof : 

For  simplicity,  let  us  write  D(e),  P^U)  and  Q^(e)  as  D,  P^ 
repectively  for  i=l,2.  From  (2.41)  we  have 

Wi’VMi"1 

M  N~ 

Therefore 


Or 


M_1D=DN 


(2.42) 


Suppose 

d^— d2~ ...— dgo— If  dso+lc*»*=dsl“e»*M» 
dsm-l+1=**-=dns€mf  Sin=n 

Define 


RQ-{l,...fs0},  Ri-{s0+lr  ...rSi),...,  I\n*lsm_1+l,...,sfo} 
If  i,j  Rj^f  k=0,...f  from  (2.42)  we  have 

(MT1(e))ij=(N(e))  ij 


and  Q 


or 


(M-1(0)),,»(N(0))4^ 


If  i *  Rfcp  ^  £ ^k2f  k1^2  t^ien^  because  (M“ -L{0)) ij  and  (N(0))^j  are  finite 
for  all  i, j,  we  have 


(N(0) ) ij=Nij-Jf  kx<k2 
(ri-1^))^,  kx>k2 

Then  N (0)  and  M“^(0)  can  be  expressed  as 


N(0)  = 


n 


11 

n21  n22 


■nm+l/nH-l  J 


(2.43) 


* 

nm+l,m+l 


Hence 


M(0) 


* 


rWl,nrt-l 


(2.44) 


By  definition  of  M  and  N  we  have 


NS^M  *  q2p2 


Therefore  in  view  of  the  special  forms  of  (2.43)  and  (2.44),  the  k-th 


principal  sub-block  of  0^(0)  P^(0)  and  Q^OJP^O)  denoted  by  [Qi^ilk  311,3 
^p.2^k  0311  be  re3-3te<3  by 


Pll  A  1 

rnn_1  1 

.  0 

* 

• 

(Qlpllk 

* 

• 

o  , 

nkk_ 

L  nkk  J 

=  [Q2p2]k 


(2.45) 


Then  applying  Fact  1  to  (2.45)  we  see  that  the  k-th  Schur  complement  of 
[Q2p2^k  =  Hk2  is  reiafced  to  the  k^  Schur  complement  of  by 

Hk2  =  nkkHklnkk-1 

Thus  we  have  shown  that  for  different  decompositions  P-^DQi  and  P2DQ2, 
the  Schur  complements  are  related  by 

A2(c)  =  UA^(e)U_1 

where 


U  =  diag{nllf . . . .  ,nnm} 

and  n^,  i=l,....,m  are  the  diagonal  blocks  of  the  matrix 
N(0)  =  Q2(0)Q1(0)"1  =  diagin^....,!^} 


From  this  lemma  we  see  that  although  the  Q(0)P(0)  is  not  unique, 
its  successive  Schur  complements  are  similar.  Furthermore,  the  time- 


2.8  Summary  Plus  An  Extension 


2.8.1  Introduction 

In  this  chapter  we  have  developed  a  new  approach  for  determining  if 
a  system  of  the  form 

x  =  A(e)x 

has  well-defined  time  scale  behavior  and  for  constructing  the 
corresponding  multiple  time  scale  approximation.  As  we  discussed  in 
Section  2.1,  a  major  motivation  for  this  work  was  a  desire  to  gain 
additional  insight  into  the  algebraic  structure  of  systems  with  several 
time  scales  and  to  perhaps  develop  an  approach  that  is  conceptually  (and 
hopefully  algorithmically)  simpler  so  that  further  study  of  these 
systems  might  be  facilitated.  We  feel  that  the  approach  described  in 
this  chapter  accomplishes  this,  since  (1)  we  have  been  able  to  provide  a 
clear  bridge  between  the  general  work  of  Coderch,  et  al.  [7]  and  the  far 
simpler  and  more  transparent  results  developed  byKokotovic,  et.  al.  [17] 
for  systems  in  what  we  have  termed  "explicit  form",  and  (2)  the  approach 
established  in  this  chapter  provides  the  foundation  for  posing  and 
answering  numerous  important  questions  about  systems  with  several  time 
scales.  In  this  section  we  first  briefly  review  the  main  results  of 


this  chapter  to  support  the  first  of  these  points  and  we  then  present 
two  additional  results  which  support  the  second  point.  The  remaining 
chapters  of  this  thesis  provide  further  indication  of  the  usefulness  of 
this  approach  to  time  scale  analysis. 

Let  us  first  review  the  major  result  of  this  chapter. 
Specifically,  we  have  analyzed  the  multiple  time  scale  structure  of  a 
system  of  the  form 

x  =  A(e)x  (2.45) 

where  A(e)  is  assumed  to  be  Hurwitz  for  e  f  [0,Cq],  Our  approach  consists 
first  of  performing  a  Smith  decomposition  of  A(e) 

A(e)  =  P(c)D(e)Q(c)  (2.46) 

where  |P(0) IHO,  |Q (0) IHO  and 

D(e)  =  diag  (ek0I0,  €klI1,....,ekn_1In_1) 

kj  then  consider  a  similarity  transformation  on  (2.45)  which  leads  us  to 
the  explicit  form 

y  =  D(e)S(e)y 

where  A(e)=Q(e)P(e)  and  the  reduced  explicit 


z  =  D(e)Sz 


(2.47) 


* 

l*: 


r;v  ■  v'  ;"■  ;  -■;»  v—V  *”■'  ’V’V 


r  vtv»vt  \rv  v"  v\ 


."•'V^-'\^  ,-■  ■--  ,-v  '  y.'yvvv'Yl 


’  An 

A12 

....  A^ 

=  S(0)  = 

A21 

h.2 

....  A2n 

LAnl 

^v2 

1 

j 


(here  the  block  dimensions  are  compatible  with  those  for  D(c)  in 
(2.47)).  We  have  shown  that  A(e)  satisfies  the  MSST  condition  i£  and 
only  if  each  of  the  successive  Schur  complements  Aj*  (see  Section  2.6 J2 
for  definition)  of  A  is  Hurwitz.  In  this  case  we  have  then  shown  how  to 
construct  a  similarity  transformation  T  so  that 

lim  sup  1 1  exp{A(e)t}  -  TexpfAfeJtJT-1  I  I  =  0  (2.48) 
e->0  t20 


where 

A(e)  =  diag  l  g^A-q  ,  eklA22  , . . . , 

That  is,  (A-Q,....,Ann;T)  define  a  time-scale  decomposition  of  the 
original  system. 

Let  us  make  two  comments  about  this  result.  The  first  is  that  it 
is  straightforward  to  extend  this  result  to  the  case  in  which  we  assume 
only  that  A(e)  is  semistable  for  e>0,  i.e.  that 

lim  exp{A(e)t} 
t->«* 


exists  (so  that  A(e)  has  no  eigenvalues  in  the  right-half  plane  and  the 
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m!,r?r*y\,y 


only  eigenvalues  on  the  jw  axis  are  simple  eigenvalues  at  0).  Under 
this  condition,  the  Smith  decomposition  of  A(e)  is  as  in  (2.46),  but 
D(e)  has  the  form 

D(e)  =  diag(ek0I0,....,G,cn-1In»1,0)  =  diaglD^e)  ,0] 


In  this  case  the  explicit  form  of  A(e)  is 


D(e)A(e)  = 


\(e)  O' 

"  A1(e) 

A2(ef 

.0  0. 

.  0 

0 

Now  A-j^e)  is  invertible  since  the  upper  block  corresponds  to  the  nonzero 
eigenvalues  of  A(e).  Furthermore,  as  direct  consequence  of  the  result 
in  this  chapter,  A(e)  will  have  MSSNS  only  if  A^  (0)  is  invertible,  i.e. 
only  if  A^e)  is  unimodular+.  Therefore,  let  us  assume  this  is  the  case 
and  define  the  unimodular  matrix 


R(e) 


I  -A1_1(e)A2(e) 
0  I 


Then 


»  ■  I  'M  *  * 


+  By  the  assumption  that  lim  expA(c)t  exists, 

t~  >®° 


Hurwitz. 


then  Aj(0)  is  in  fact 


r 
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L^nl  2  ^nn -I 

(here  the  block  dimensions  are  compatible  with  those  for  D(c)  in 
(2.47)).  We  have  shown  that  A(e)  satisfies  the  MSST  condition  if  and 
only  if  each  of  the  successive  Schur  complements  Aii  (see  Section  2.6.2 
for  definition)  of  A  is  "urwitz.  In  this  case  we  have  then  shown  how  to 
construct  a  similarity  transformation  T  so  that 

lim  sup  1 1  exp{A(e)t}  -  Texp{A(e)t}1<-1  I  I  =  0  (2.48) 

e->0  t20 


where 

A(e)  =  diag[ek0A11/eklA22'»*»'ekn_1*nn^ 

That  is,  (A^lf....,Ann;T)  define  a  time-scale  decomposition  of  the 
original  system. 

Let  us  make  two  comments  about  this  result.  The  first  is  that  it 
is  straightforward  to  extend  this  result  to  the  case  in  which  we  assume 
only  that  A(e)  is  semistable  for  e>0,  i.e.  that 

lim  exp{A(e)t} 
t->«* 


exists  (so  that  A(e)  has  no  eigenvalues  in  the  right-half  plane  and  the 


/.v.'.v.v.'  v  '.i  ■  v  v  'i  ■  ■.  ■  v  -<v  <  v'  ’ . 


R-^eJDtejSteJRfe)  = 


D^ejA^e) 

0 


0 

0 


Now  apply  the  time  scale  Decomposition  developed  in  this  chapter  to  the 
explicit  form  matrix  D^ejA^e).  If  this  does  not  have  well-behaved 

time  scale  structure,  then  neither  does  the  original  system.  If  it  does 
have  well-behaved  time  scale  structure,  let  (A11,....,Ann;T)  denote  the 
time  scale  decomposition  of  this  system.  Then  let 


A(e)  =  diagte^Ajj,....,  0} 

T-^  =  diag[T  0] 

T  *  P(0)R(0)T1 

A 

It  is  then  straightforward  to  check  that  (A(c),  T)  define  the  time  scale 
decomposition  of  A(e). 

The  second  point  we  wish  to  make  is  that  if  A(e)  does  not  satisfy 
the  MSST  condition  (i.e.  if  not  all  of  the  A^  are  Hurwitz),  then  (1) 
the  reduced  explicit  form  (2.15)  need  not  be  asymptotically  equivalent 
to  the  explicit  form  (2.12)  or  the  original  system  (2.2);  and  (2)  the 
origenal  system  does  not  have  well-defined  time  scale  behavior  in  that 
sense  that  we  have  used  so  far  —  i.e.  it  is  not  possible  to  satisfy 
(2.10}  for  any  choice  of  constant  matrices  A^  (i.e.  not  restricting 
attention  to  simply  the  definitions  of  these  matrices  as  successive 
Schur  complements).  Coder ch  [7]  conjectured  that  it  might  be  possible 
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Let  us  show  that  this  system  does  not  have  well-defined  time  scale 
behavior  in  the  sense  we  have  been  using  so  far.  Substituting  r=et  we 
have 


xjfc)  =  x1Qe-c~€cs inc/e 
x2(r)  =  X20e“c“eccost/e 


(2.52) 


Obviously  x(t)  does  not  have  a  limit  as  e->0  because  of  the  rapid 
oscillations.  On  the  other  hand,  in  order  to  have  a  well-defined  time 
scale  approximation  we  require  that  there  is  some  Aq  so  that 

lim  sup  1 1  exp{A(e)t}  -  Texp{AQt}'F“1  1 1  =  0 
e->0  t20 


If  this  were  true,  then  the  following  should  also  be  true: 

lim  sup  1 1  exp{A(e)t/c}  -  TexpiAgt/eJT'1  1 1  =  0  (2.53) 

e->0  c20 

It  has  been  shown  in  (2.51)  that  the  first  term  does  not  have  a  limit  as 
e  goes  to  zero.  So  the  only  way  to  have  (2.53)  satisfied  is  if  the  left 
hand  side  is  identically  0,  but  using  a  constant  Aq  does  not  allow  us  to 
do  this. 


A  further  investigation  shows  that  if  we  slightly  change  our 
definition  of  well-defined  time  scale  behavior,  then  we  may  be  able  to 
extend  our  results  to  a  larger  class  of  systems.  For  example,  if  we  use 


to  find  an  approximation  in  the  sense  of  (2.10)  even  if  A(e)  violates 
the  MSST  condition  if  one  allows  the  Aj^  to  be  e-dependent.  In  the  next 
two  subsections  we  show  how  to  construct  such  an  approximation  which 
keeps  only  those  e-dependent  terms  that  are  needed  to  achieve  (2J.0). 

Finally,  in  Section  2.8.4  we  discuss  the  problem  of  using  feedback 
to  modify  the  time  scales  of  a  perturbed  system.  Specifically,  up  to 
this  point  we  have  considered  the  multiple  time  scale  behavior  of  the 
undriven  system  (2.2).  In  2.8.4  we  discuss  the  driven  system 

x  =  A(e)x  +  B(e)u  (2.49) 

with  the  state  feedback 

u  —  k(e)x  (2.50) 

Hie  natural  question  here  is  to  ask  what  freedom  there  is  in  assigning 
the  time  scale  structure  by  application  of  the  state  feedback  of  (2.50). 
If  we  insert  (2.50)  into  (2.49),  we  have 

x  =  [A(e)  +  B(e)K(e)]x  *  F(e)x  (2.51) 


with 

F(e)  =  A(e)  +  B(e)K(e) 

As  we  have  developed  in  this  chapter,  the  time  scales  of  a  system  like 
(2.51)  are  determined  by  the  invariant  factors  of  F(e).  Thus  assigning 
the  time  scales  of  (2.49)  by  state  feedback  (2.50)  naturally  leads  to 
the  problem  of  invariant  factOI  assignment  of  F(e)  by  means  of  choosing 


constant  invertible  matrix  T  such  that 

11m  sip  1 1  exp{A(e)t}  -  Texp{A(e)t}T"l  1 1  =  C 
e->0  t>0 

where 

A(c)  =  diag[A0(e),eA1{e),...fen_1An_1(e)] 

Let  us  now  study  the  relationship  between  MSSNS  condition  and  the 
eigenvalues  of  A(e)  because  a  deeper  understanding  will  help  us  capture 
the  essence  of  our  approach.  Recall,  A(e)  being  Hurwitz  for  Gq>€>0  is 
an  implicit  assumption  throughout  this  Chapter  (except  at  the  end  of  the 
preceding  subsection).  We  claim  that  if  A(e)  satisfies  the  MSSNS 
condition,  Aj^  cannot  have  eigenvalues  with  positive  real  parts.  The 
reason  is  the  following.  As  we  shall  see  in  Chapter  3,  Theorem  3.2,  the 
eigenvalues  of  A(e)  are  clustered  in  n  groups,  with  those  in  the  k-th 
group  lying  within  0(e^+l)  of  the  eigenvalues  of  ek^Ajj.  Consequently, 
if  Ajj  for  some  j  has  an  eigenvalue  with  positive  real  part,  then  one  of 
the  eigenvalue  of  A(e)  must  have  a  positive  real  part  for  e  sufficiently 
close  to  zero.  This  contridicts  the  assumption  that  A(e)  is  Hurwitz. 
What  can  happen,  however,  is  that  for  some  A^  the  real  parts  of  some  of 
their  eigenvalues  are  zero.  If  A(e)  has  MSSNS  (so  that  the  Aj^  are 
invertible),  we  must  then  have  that  these  eigenvalues  are  purely 
imaginary  (i.e.  not  equal  to  zero).  The  implication  of  this  for  A(e)  is 
that  its  corresponding  eigenvalues  have  negative  real  parts  of  orders 
that  are  higher  than  those  of  their  imaginary  parts.  By  keeping  only 
the  dominant  terms,  then,  we  in  essence  throw  away  these  damping 


effects.  If  we  can  somehow  keep  the  dominant  terms  of  these  real  parts, 
we  should  still  have  a  good  approximation  as  we  have  seen  in  Example 


2.8.1. 


In  the  next  subsection  we  introduce  an  iterative  approach  that 
guarantees  the  retention  of  the  dominant  e-dependent  parts  of  each  mode. 
This  approach  is  based  on  the  following  observations.  Consider  the  two 
time  scale  case. 


A(e)  = 


A 


B 


jeC  eDj 

where  A  is  invertible.  We  have  shown  that  there  are  a 
T(e)  and  two  matrices  Gj  (e) ,  G2(e)  such  that 


(2.54) 

unimodular  matrix 


A(e)  =  T(e) 


Gjte)  0 
0  ^2(e) 


Tr'1(e) 


To  really  compute  G^ (e)  and  G2(e)  is  impractical  .  In  the  previous 
sections  we  used  A  and  D=D-CA“^B  to  approximate  G^(e),  G2(e).  But,  as 
we  just  pointed  out,  if  the  MSST  condition  does  not  hold  then  this 
approximation  fails,  and  we  have  to  retain  some  e-dependent  terms.  We 
also  know  that  G1(e)  and  G2(e)  can  be  obtained  by  an  iterative  procedure 
if  A(e)  satisfies  the  MSSNS  condition  [17].  As  we  show  in  the  next 
subsection,  if  we  somehow  know  the  orders  of  the  real  parts  of  the 
eigenvalues  of  A(e),  then  after  a  specific  finite  number  of  iterations 
the  results  capture  the  dominant  parts  and  we  can  stop.  In  the  next 


subsection,  we  first  describe  the  iterative  procedure  proposed  in  [17]. 
Then  we  prove  some  lemmas  which  are  used  to  determine  the  stopping 
rules.  Finally  we  describe  our  approach,  which  includes  a  method  for 
determining  an  upper  bound  on  the  orders  of  the  real  parts  of  the 
eigenvalues. 


2.8.3  An  Iterative  Algorithm  for  Extended  Time  Scale 
Decomposition 

As  mentioned  in  the  previous  subsection,  we  have  shown  in  Chapter 
2,  Section  2.5  that  we  can  write  A(e)  in  (2.54)  in  the  form 


A(e)  =  T(e) 


Gj(e)  0 
0 


T"^(e) 


(2.55) 


There  is  an  iterative  approach  to  determine  G^(e)  and  (^(e)  which 
is  presented  in  the  foil  wing  Lemma. 


Let  us  first  define  matrices  and  associated  with  a  matrix  F 
where 


F11  f12 
f21  f22 


(here  F  may  be  e-dependent  but  we  suppress  this  dependency) 


Specifically 


I  fu-*f12 


-F2iFn'1  1 


(2.56) 


if  F-q  is  invertible.  Then  we  have 


Lenina  2.8.1 


E0(0)(e)  =  A(e)  = 


A(e)  B(e)l  L<0>  B0(°) 


eC(e)  eD(c)J  eCg(°>  cDg(°) 


(2.57) 


where  Ag^,  Bg^,  Cg^  and  Dg^  are  generally  functions  of  e  and 
Ag(°)  is  invertible  at  e=0.  Define 

E^te) 

=  P2 (3) ...P2 <i)p1  (i)  ...P1<1)Eg^°)  (e)  (P ■1<1))“1... 

(Px  ( i) ) _1  (p2  (i) ) _1 . . .  (P2  (^ ) ) -1 


e3+1Ci(3)  eD.(j) 


(2.58) 


Then  as  i — >°* ,  j — >» 


A^3)  — >  Dj^^)  — >  g2 


where  and  G2  are  defined  in  Section  2.5.2  (see  (2.19))  and  p^)  and 


are 


P2^  are  the  matrices  associated  with  Ej_^®)(e) 
respectively,  as  in  (2.56).  Again  all  terms  in  (2.58) 
0>2  are  functions  of  e. 


and  Ej^j-1)(€), 
including  and 


Proof  i_  See  [17] . 


tote;  Prom  (2.56)-(2.58)  we  have 

Ai(j+1)  =  A^j)  +  (A^^ <3> )_1  (2.59) 

Ai+l(j)  ■  Ai(^)  +  (2.60) 

D^+D  >  (2.61) 

Thus,  after  the  (i+j+l)-th  iteration 

0(Ai<3+1)  -  q(3))  2  i+j+1  (2.62) 

°(Ai+l(j)  "  Ai(j))  ^  i+3+l  (2.63) 

=  0(Di+1(3>  -  q^)  2  i+j  (2.64) 


Therefore  we  would  expect  that  after  a  finite  number  of  iterations 
the  A^( 3)  and  D^(^  would  be  "close  enough"  to  and  G2  so  that  we  can 
use  them  to  approximate  G^  and  G2.  The  next  two  lemmas  give  the 
definition  of  "closeness"  and  provide  conditions  based  on  which  we  can 
determine  the  number  of  iterations  required.  The  first  lemma  states  the 
eigenvalue  order  relations  for  two  matrices  A1(e)  and  A2(e)  under  which 
expfAjfeJt}  can  be  approximated  by  exp{A2(e)t}.  Let  i=l,2. 


j=l,...,N  be  eigenvalues  of  A^(e)  and  A2(c),  repectively,  and  let  0(x) 
denote  the  order  of  x,  then  {  suppressing  the  e-dependence  of  A  ^  ) 

Lenina  2.8.2  If  A^(e)  and  A2(e)  satisfy 

1.  A^O)  =  A2(0)  has  SSNS. 

2.  0(Re(A1:j))  =  0(Re(A2j))  =  rj  <  OUteW^-i^j))#  j=l,...,N. 

3.  Odm^^))  =  0(Im(A2j))  -  Sj,  and  if  rj2Sj  then 

then 

lim  sup  ||  exp{A-,(e)t}  -  exp{A2(e)t}  1 1  =  0 
e->0  t>0 

Pr«?£: 

Define  R[A(g),a]  =  (M-A(e))“^  as  before.  From  Condition  1  we  know 
that  the  nuirber  of  the  eigenvalues  whose  real  parts  have  orders  higher 
than  0  is  the  same  for  both  A^c)  and  A2(e).  Denote  this  number  by  m 
and  suppose  that  the  first  m  eigenvalues  are  these  higher-order 
eigenvalues.  From  the  assumption  that  A(e)  is  Hurwitz  for  0<g<Gq,  it  is 
also  clear  that  the  leading  term  (i.e.  the  lowest  order  term)  of  the 
real  part  of  each  eigenvalue  has  a  negative  coefficient.  Now,  write 
R[Ai(e),A]  as 

m  F^fc)  E^e) 

RfAi  (c) , a]  =  T  - i -  +  — - -  ,  i=l,2  (2.65) 


where  p^e)  =  (A"Ai,m+2)  — (A_xi,N)  and  Fij  (e)  and  E^(e)  are 

functions  of  «  only  and  are  analytic  at  e=0.  We  know  that 


exptA-jJeJt}  -  exp{A2(e)t} 

=  $  {R[A1(e}fA]  -  R[A2(e),A}}eAtdA  (2.66) 

where  C  is  the  contour  enclosing  every  eigenvalues  in  the  complex  plane. 
Using  (2.65),  (2.66)  can  be  rewritten  as 


m  . 

E  4( 

1=1  Cj 


Flj («) 

A-Aij 


F2 j 
A->2j 


}eAtdA  +4{ 
C' 


E^e) 

Pl(A,e) 


E2(e) 

- 

P2(A^e) 


(2.67) 

where  Cj  are  contours  enclosing  eigenvalues  A^j  and  *2j,  j=l,...,m  and 
C'  is  a  contour  enclosing  all  A^,  Ajjr  j=nw-l  ,...fN.  Since 
RefA^e)) — >Re(Aij(0))<0,  i=l,2,  j=m+l,...,N,  C'  can  be  chosen  to  lie 
entirely  in  the  left  half  plane.  Therefore,  since 


E]_  (c)  E2  (e) 
Pl(A,c)  p2(x,e) 


— >  0,  e — >0  (Condition  1) 


Pl(A,e) 

C' 


E2(e) 

P2  (A,g) 


eAtdA 


c — >0 


E1(e)  E2(e) 

Pl(x,e)  p2(A,e) 


dA 


— >  0,  c— >0 


(2.68) 


Now  consider  only  one  term  in  the  first  part  of  (2.67) : 


cj  *-*lj 


■}  e  ’-dX 


(2.69) 


Let  X'=Ver^.  Then  (2.69)  can  be  written  as 


4  t-JsL 


C'j  cW A'-A1;j  Cr^V-X2j 


F 

t-* - }  exp{cr^A't}dVer^ 


-  *  1-2*. 


Cj  X'-X'ij  V-V2j 


-}  expfx'cldX1 


(2.70) 


where 


*'ij  -X^/c1*,  z=erit,  i«l,2. 


From  Conditions  2  and  3  and  the  Hurwitz  assumption  we  have 


0>Re(A,ij)  =  0(1), 

Re(A'lj-A'2j)  “  od)#  Wij-A^j)  *  o(l)  (2.71) 

Therefore  it  is  obvious  that  C'j  can  be  taken  as  a  circle  in  the  left 
half  plane  whose  diameter  is  of  order  1.  Since  the  order  of  the 
imaginary  part  may  be  lower  or  higher  than  that  of  the  real  part,  this 
circle  may  move  up  or  down  vertically  as  e  changes  but  it  remains  in  the 
left-hand  plane  and  does  not  shrink  to  0.  Also 


d’”A2j ' )Fij“(A'-Xij ' )F2j 


(2.72) 


On  contour  Cj',  X^j' — >  X2j,f  F2j  — Flj  as  e — ^  anc*  fc^e  denominator 
is  bounded  away  from  0.  Therefore  the  integrand  of  (2.70)  goes  to  zero 
uniformly  on  C'j  as  e — >0.  Considering  (2.68)  we  can  conclude  that 
(2.67),  and  therefore  (2.66),  goes  to  zero  uniformly  in  t  as  e — >0. 

This  Lemma  gives  us  a  criterion  to  judge  how  "close"  the 
eigenvalues  of  two  matrices  should  be  in  order  for  one  to  be  a  good 
approximation  of  the  other.  In  the  next  Lemma  we  study  how  a 
perturbation  on  a  matrix  can  influence  the  eigenvalues.  To  develop  a 
complete  picture  of  how  perturbations  affect  the  eigenvalues  is  beyond 
the  scope  of  this  development.  For  our  purpose  of  defining  a  stopping 
point  for  our  iterative  computation  of  G-jJe)  and  G2(e),  we  only  need  the 
following  result. 


Lems  2»-8.3  : 

Let  A=B+cmC  where  A,B  and  C  are  NxN  matrix  functions  of  e.  Let  the 
eigenvalues  of  A  and  B  be  denoted  by  >^(A)  and  a^(b),  i=l,...,N.  Then 


min  0(X^  (a)-A^(b)  )  2  Vp  2  n/N 
i 

where  p  is  some  integer,  lip<N. 


(2.73) 


Combining  this  Lenina  with  (2.5 9) — (2.64)  it  is  clear  that  after  the 
(i+j+l)-th  iteration  the  eigenvalue  difference  between  the  (i+j)-th  and 
the  (i+j+l)-th  iterations  must  have  an  order  higher  than  (i+j)/N. 
Therefore,  as  the  number  of  iterations  becomes  larger  and  larger,  the 
contribution  of  each  additional  iteration  becomes  less  and  less.  Thus, 
a  stopping  rule  can  be  established  if  we  in  addition  take  Lemma  2.8.2 
into  account.  This  result  can  be  sunmarized  in  the  following  Theorem. 


Theorem  2.8.1: 

Let 


"A0(°)(e)  B0(0)(e)' 

r  o 

A0i°>(e)  B0(°)(e)' 

A(e)  = 

eCo(0)(e)  eD0(°)(e) 

—  — 

0  el 

C0(0)(e)  Do(0)(e) 

=  D(e)S(e) 

satisfy  the  MSSNS  condition.  Define  ,P2^ *Ai^  (c)  and  Di^  (e) 
as  in  (2.56), (2.58) -(2.61).  Suppose  the  maximum  order  of  the  real  parts 
of  the  eigenvalues  of  A(e)  is  m.  Then 


lim  sup  II  exp{A(e)t}  -  TiexpiAiteJtjTi""^ 
e->0  t20 


where 


=  0 


(2.74) 


C 

0  eD  (e) 


(2.75) 


and  p+q=mN+l,  where  N  is  the  dimension  of  A(e)r  p>0,q£0  and  is  a 
constant  matrix  defined  in  the  proof. 

EL.QS£: 

For  simplicity  we  will  write  Aq^(O),  Bq^(O),  Cq^)(0),  and 
Dq^(O)  as  A,B,C,  and  D,  respectively.  Frcsn  Section  2.5  we  see  that 

Gi(e)  0 

A(e)  =  T(e)  T^e)  (2.76) 

0  eG2  (e) 

and  G-^OJs-A,  G2 (0)  =D-CA~1B.  It  is  also  clear  fron  Eq.  (2.5  9) -(2.61)  that 
Aj^)  (0)  -  A,  Dj^fO)  =  IX^B,  i+j2l 

Therefore 

Ag(P)  (0)  =G1(0),  DqW(0)=G2(0) 

and  these  matrices  have  SSNS.  Furthermore,  from  Lemma  2.8.1  we  know 
that 

Ai(j)_>Glr  D^)—  >g2,  i+j— ><» 

On  the  other  hand,  one  more  iteration  than  p+q=mN+l  will  change  the 
eigenvalues  of  Ag^P^  on  the  order  of  (mN+l)/N=m+l/N>m.  Since  the 
highest  order  of  the  real  parts  of  eigenvalues  is  m,  we  see  that  this 
implies  that  Condition  2  of  Lemma  2.8.2  is  satisfied.  Furthermore, 
Condition  3  is  automatically  satisfied  if  r^^s^,  since  as  we  just 


argued,  the  order  change  introduced  by  one  more  iteration  will  be 
higher  than  rj  and  therefore  greater  than  Sj.  From  Lemma  2.8.2,  we 
therefore  have 

lim  sup  II  expfA-j^t}  -  expfGit}  1 1  =  0  (2.77) 
e->0  t>0  H 

A  similar  equation  holds  for  and  G2. 

Now  define  the  product  of  the  iteratively  applied  and  P2^ 
to  be  T^-^te)  which  is  obviously  unimodular  and  therefore  contains  no 
information  about  the  time  scales.  It  is  also  easy  to  check  that 
T1(0)=T(0),  where  T(e)  is  defined  in  (2.76).  Thus,  we  finally  have 
(2.74)  with  T-^T^O). 


Note: 


1.  Since  A(e)  is  Hurwitz  for  eQ>e>0,  the  real  parts  of  its 
eigenvalues  cannot  be  zero  and  are  in  fact  negative.  Therefore  m  is 
finite.  In  fact;,  let  the  order  of  the  determinant  of  A(c)  be  0^.  Then 

0£p{\\\),  i*l,...,N 

But  I  A^|2=(ReAjj2+(im\^)2.  Therefore  it  is  easy  to  see  that 

liO(Re\^),  i=l,...,N.  In  other  words,  0^  is  an  upper  bound  on  m. 


2.  The  integers  p  and  q  in  (2.75)  are  not  unique  but  their  sum  — 


the  number  of  total  iterations  must  be  sufficiently  large.  It  is  quite 


possible  that  the  iteration  can  stop  much  earlier.  Nevertheless, 
pfq=NO^  is  a  safe  bound  for  stopping. 

3.  In  Theorem  2.8.1  only  the  first  step  of  the  multiple  time  scale 
decomposition  has  been  shown.  But  the  general  procedure  is  only  a 
simple  extension  of  this  theorem.  The  detailed  procedure  is  as  follows. 


(1)  Determine  the  Smith  form  of  A(e)=P(e)D(e)Q(e) . 

(2)  Let  D(e)Q(c)P(e)=D(e)S(e)  where 


Me)  = 


A11 

**21 


eA' 


u2 


22 


eA 


In 


2n 


^Anl  60  lftn2 . ^'Xn 


(3)  Compute  the  order  of  the  determinant  of  A(e)  and  use  it  as  an 
upper  bound  on  m. 


(4)  Similar  to  the  procedure  for  deriving  the  multiple  time  scale 
decomposition  in  Chapter  2,  we  treat  A-q  and  the  complement  lower  right 
principal  submatrix  as  Aq^  and  Dg^  in  Theorem  2.8.1  and  apply 
pKpND^+1  iterations  to  them. 

(5)  Treat  the  resulting  as  the  original  matrix  A(e)  and  go 
back  to  step  (3). 


(6)  Repeat  this  procedure  until  all  time  scales  have  been  revealed. 


4.  The  result  in  the  previous  sections  of  this  Chapter  can  be 
viewed  as  a  special  case  of  the  theorem  in  this  section.  That  is,  if 
A(e)  satisfies  the  MSST  condition,  then  for  each  time  scale,  only  one 
iteration  is  required.  But  if  we  do  not  know  if  the  system  satisfies 
MSST,  we  have  to  do  the  following.  First  we  apply  the  procedure 
described  in  the  preceding  section,  which  corresponds  to  performing 
only  one  step  of  the  iteration  described  in  this  section  at  each  time 
scale.  If  for  all  time  scales  the  resulting  deagonal  blocks  (i.e.  Aj^ 
as  we  denoted  before)  are  Hurwitz  at  e=0,  then  we  are  done.  Otherwise, 
if  at  some  step  i,  the  resulting  Aj^  is  not  Hurwitz  at  e=0,  we  have  to 
go  back  to  the  very  beginning  and  perform  the  full  set  of  iterations  at 
each  time  scale. 


2.8.4  Invariant  Factor  Assignment  When  A(e)  And  B(e)  Are  Left 
Coprime 


In  this  subsection  we  solve  the  problem  of  invariant  factor 
assignment  by  state  feedback  when  A(e)  and  B(e)  are  left  coprime  over 
the  ring  T.  To  prove  the  main  results  we  need  the  following  Lemma. 


Lenma  2.8.4 


Let  A(e),  B(e)  and  K(e)  be  nxn,  nxl  and  lxn  matrices  over  T  and 


A(e)  ~  diagla-,,...^]  =  SA 


where  a1,....,an  are  invariant  factors  of  A(e)  and  denotes 
equivalence  via  multiplication  (on  the  right  and  left)  by  unimodular 
transformations.  Let 

F(e)  =  A(e)  +  B(e)K(e)  (2.78) 

Then  for  any  K(e)  €  ,^axn  there  is  a  R(e)  *  I™301  such  that 
F(c)  ~  SA  +  5(e)R(e) 

where  5(e)  is  an  upper  triangular  matrix  and  B(e)  ~  5(e).  Furthermore 
there  exist  unimodular  matrices  R  and  S  such  that 

K  =  RRS 

We  know  that  there  exist  unimodular  matrices  P  and  Q  such  that 


A  =  PSaQ 

Therefore 

A  +  BK  ■  PSaQ  +  BK  ■  P(SA  +  P_1BKQ'1)Q 


and 


A  +  BK  ~ S*  +  P-1BKQ-1  =  W 


(2.79) 


Furthermore,  by  elementary  column  operations  we  can  show  that 


P-1BU  =  5 


where  U  is  a  unimodular  matrix  and  5  is  an  upper  triangular  matrix, 


Therefore  (2.79)  can  be  rewritten  as 


W  =  SA  +  Btf^KQ"1 


We  have 


K  =  LT1KQ-1 


W  =  SA  +  BR 


and  the  lenxna  is  proved. 


What  Lemma  2.8.4  tells  us  is  that  instead  of  considering  the 
invariant  factor  assignment  of  F(e)  we  may  work  on  SA+B(e)R(e).  Because 
of  the  special  form  of  SA  and  6(e),  the  original  problem  becomes  much 
easier.  We  shall  see  this  in  the  proof  of  our  main  result  Theorem 
2.8.2. 

Theorem  2.8.2 

Let  A(e)  £  T11*11,  B(e)  €  Tnxr,  and  let  b  denote  the  rank  of  B(0). 
Assume  A(e)  and  B(e)  are  left  coprime.  Then 


1.  F(e)  defined  in  (2.78)  can  have  no  more  than  b  non-unit 
invariant  factors  for  any  choice  of  K(e). 

2.  There  exists  a  K(e)  such  that  F(e)  has  eT1,....,e^b  as 
its  invariant  factors,  for  arbitrary  non-negative  integers 
3 1  f  ••••#  jb  (with  the  convention  chat  e*°=0). 


1.  The  left  copnmeness  of  two  matrices  A(e)  and  B(e)  over  the 
local  ring  T  means  that  if  there  are  matrices  D(c)  T*1301,  £(e)  T0301  and 

B(c)  6  T11*  such  that 

A(e)  ■  D(e)X(e),  B(e)  =  D(e)B(e)  (2.80) 

then  D(e)  is  unimodular.  That  is  D(0)  is  nonsingular.  A  simple 
criterian  for  the  left  ooprimeness  of  A(e)  and  B(e)  is  to  check  if  [A(0) 
B(0)  ]  is  of  full  row  rank.  R>r  example, 


are  not  because  there  exists  a  nonun imodular  matrix 


and  matrices 


1  0 

1  0 

Me)  * 

t 

B(e)  = 

0  1 

e  1 

satisfying  (2.80).  Alternatively,  we  note  that 


[A  (0) 


B(0)  ]  = 


1 

1 


0  10 
0  10 


is  not  full  rank. 


2.  It  is  also  easy  to  see  that  in  order  to  ensure  the  full  rank  of 
[A(0)  B(0)  ] ,  the  rank  of  A(0)  must  be  greater  than  n-b.  Or  in  other 

words,  A(e)  itself  has  at  most  b  invariant  factors  other  than  1. 


£CfiS£: 

As  in  Lama  2.8.4,  let 
A  =  PSaQ,  B  =  P-1BU 

where  B  is  an  upper  triangular  matrix.  Consider 
W  ■  SA  +  BK 


Suppose  A  has  n-m  unit  invariant  factors. 


Then  in  view  of  left  coprimeness  of  A  and  B,  B  must  have  the  form  of 


where  x  represents  and  arbitrary  element  in  T. 


First,  let  us  assume  that  m=b.  Then  we  can  construct  a  unimodular 
matrix  V  such  that 


Suppose  gp— 9®  are  the  desired  closed-loop  invariant  factors.  Let 


K  *  V 


0  0 
L  0  G 


J 


where  *  diaglgi-€il,...,gir|-e*ln] .  Then 


SA  +  ER 


I  w 
o  g2 


where  G2  =  diag  [gi,...,gm]  and  W  is  some  (r-m)xm  matrix.  After  some 
elementary  operation,  we  can  show  that 


SA  +  BK  ~diag[l,..,l,g1,...,gm] 


By  Lemma  2.8.4  we  know  that  A+BK  has  invariant  factors  g^,...^  where 


K  =  UKQ 

Now,  assume  m<b.  Then  after  some  elementary  column  operations  we 
can  always  achieve 


r 


X  . *•*•••  X 

0...0  l  o . o 


0..0  1  0 . 0 

X  . X 


n-m 


m 


Namely  there  will  be  b-m  rows  (say,  rows  k^,...^^^  among  the  first  n- 
m  rows,  each  of  which  has  one  unit  and  r-1  zeros.  Furthermore  if  the 
unit  in  kj-th  row  is  located  in  the  Sj-th  column,  then  Sj^Sp,  if  j^p. 
That  is,  the  units  for  different  rows  are  located  in  different  columns. 


If  gir...,g5  are  the  desired  invariant  factors,  we  can  construct  a 
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R  in  a  similar  manner  to  that  used  previously: 


where 


K1 


kl  k2 

0  0 . 0....0" 

0  0 . 0....0 

0  0....g2-1..0  j2 

0  gi-l  ..0....0  ji 

0  0  . 0 


and 

G3  =  diag  I  gb-m+i-® 11  - - 


similarly  we  have 


SA  +  BK  = 


where 


lx  X  .  X 


0....0  g1  0 . 0 

X....X  X  1  X...X 


0 


0  g^J 


G4  =  diag[^I)_nH.^f....fgj:)] 


and  K3  is  some  arbitriry  matrix.  After  some  column  operations  we  have 


AD-A162  896 


UNCLASSIFIED 


AN  ALGEBRAIC  APPROACH  TO  TINE  SCALE  ANALVSIS  AND  2/ 

CONTROL(U)  HASSACHUSETTS  INST  OF  TECH  CAHBRIDGE  LAB  FOR 
INFORHATION  AND  D.  .  X  LOU  OCT  85  LIDS-TH-1585 
RFOSR-TR-85-1129  DAAG29-84-K-8885  F/G  12/1  NL 


SA  'vdiag[l, • . .  ,l,g^,l, . . . ,  If  g2*  *  *  *  #9b^ 


By  Lemma  2.8.4  we  know  A+BK  has  invariant  factors  gp...^.  This 
proves  the  second  part  of  Theorem  2.8.2. 


Let  N  be  the  number  of  unit  invariant  factors  of  A.  It  is  easy  to 
see,  that 


N  =  rank [A(0)+B(0)K(0) ] 


Using  Lemma  2.8.4  this  is  equivalent  to 


N  «  rank[SA(0)+BR(0)] 


And  as  pointed  out  before 


SA(0)  +  B(0)R(0)  *  diag[l,...,l,0,...,0] 


x  . x 

+  x  .  x  R(0) 


■ .  1 


where  x  represents  either  1  or  0.  Since  the  rank  of  B{0)  is  b£m,  there 
must  exist  b-m  independent  column  vectors  in  the  first  r-m  columns  in 
B(0).  Since  left  multiplication  by  R(0)  is  nothing  more  than  column 
operations  and  the  rank  of  SA(0)  is  n-m,  adding  B(0)R(0)  to  SA(0)  can 
reduce  the  rank  of  SA(0)  by  at  most  b-m.  In  other  words,  the  rank  of 
SA(0)+B(0)R(0)  is  greater  than  n-m- (b-m)  =n-b.  Or 


N  >  n-b 


which  proves  part  1  of  the  theorem. 


(Note:  There  is  no  page  98) 


i  ■  •/  %V'  .*•  /aifrV*  •'*  «'* 
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CHAPTER  3 


EIGENVALUE  AND  INVARIANT  FACTOR  STRUCTURES 


3.1  INTRODUCTION 


So  far  we  have  shown  that  the  MSSNS  of  a  matrix  A(e)  implies  that 
the  Schur  complements  are  non-singular +  and  MSST  implies  that  ^11 
are  Hurwitz.  In  this  chapter  we  shall  introduce  new  but  equivalent 
criteria  for  the  MSSNS  and  MSST  conditions.  These  criteria  are 
important  not  only  because  they  provide  new  insight  into  these 
conditions  and  their  relationship  to  the  eigenvalues  of  A(e),  but  also 
because  they  make  it  possible  to  introduce  a  new  approach,  developed  in 
Chapter  4f  for  defining  time  and  amplitude  scaling  when  the  MSSNS 
condition  is  not  satisfied  so  that  the  traditional  time-scale 


+  As  in  most  of  our  development  in  Chapter  2,  we  assume  here  that  A(e) 
is  invertible  and  in  fact  Hurwitz  for  e  e(0,eQ].  The  extension  ofthe 
results  of  this  chapter  to  the  case  when  this  is  not  true  cah  be 
accomplished  in  a  straightforward  manner,  such  as  in  Section  2.8. 


decomposition  method  cannot  be  applied. 


First,  in  Section  3.2,  we  show  several  equivalent  conditions  for 
MSSNS.  In  particular,  it  is  proven  in  Theorem  3.1  that  the  MSSNS 
condition  is  not  only  equivalent  to  being  non-singular  as  seen  in 
last  chapter,  but  also  equivalent  to  1)  the  eigenvalues  and  invariant 
factors  of  A(e)  having  the  same  orders  and  to  2)  a  "no-cancellation" 
condition,  namely  that  for  a  certain  set  of  values  of  i  the  order  of  the 
gcd  of  all  ixi  principal  minors  must  be  the  same  as  that  of  their  sum, 
together  with  the  condition  that  this  set  of  the  gcd's  uniquely 
determines  the  invariant  factors  in  a  particular  fashion.  The 
plausibility  of  the  first  result  can  be  seen  from  the  following 
observation.  Suppose  A(e)  not  only  has  MSSNS  but  also  has  MSST.  Then 
it  has  been  shown  in  last  chapter  that  the  time  scales  are  determined  by 
its  invariant  factors.  On  the  other  hand,  the  eigenvalues  of  A(e) 
determine  its  modes.  Therefore  we  would  expect  that  if  the  system  has 
well  behaved  time-scales,  i.e.  A(e)  has  MSST,  the  eigenvalues  and 
invariant  factors  of  A(e)  should  have  the  same  orders.  Since  the  MSST 
condition  differs  from  MSSNS  condition  only  by  adding  an  extra  stability 
conditions  (  which  we  shall  touch  on  in  a  moment)  MSSNS  should  also 
ensure  this  purely  algebraic  equality.  It  is  shown  in  Theorem  3.1  that 
the  equality  of  the  orders  of  the  eigenvalues  and  invariant  factors  is 
not  only  a  necessary  condition  for  A(e)  to  have  MSSNS,  as  we  just 
argued,  it  is  also  a  sufficient  condition.  To  illustrate  these  ideas, 
let  us  consider  two  examples.  The  first  one  does  not  have  MSSNS  but  the 


'm 


second  one  does, 


Example  3.1.1 

In  this  example  we  consider  a  system  that  does  not  have  MSSNS.  As 
we  will  see,  its  eigenvalues  and  invariant  factors  do  not  have  the  same 
orders  either.  The  system  considered  is 


"l 

X1 

— €  1 

X1 

*2 

0  “ € 

x2 

This  system  does  not  have  well-defined  time  scale  behavior  which  can  be 
seen  directly  fran  the  solution  of  this  system. 

x-jjt)  *  x10e_et  +  x2ote“et 
*2^  *  x20e_et 

If  we  substitute  c=et,  we  have 

x^t/e)  =  x10e-c  +  x20t/ce""C 
x2(t/e)  =  x20e_t 

It  is  clear  that  at  the  time  scale  t=et,  the  second  term  in  the  first 
equation  becomes  unbounded  as  e  goes  to  zero.  As  pointed  out  in  [7], 
this  is  an  evidence  that  the  system  does  not  have  well-defined  time 
scale  behavior.  It  is  easy  to  see  that  this  system  does  not  have  MSSNS 
either.  Furthermore,  the  eigenvalues  of  A^e)  are  e  and  e.  On  the 
other  hand,  the  Smith  form  of  Ai  (e)  is 


Therefore  the  orders  of  eigenvalues  are  1  and  1  and  the  orders  of  the 
invariant  factors  are  0  and  2. 


Example  3.1.2 

Now  consider  the  system  matrix 


We  already  knew  that  this  system  has  well-behaved  time-scales.  It  can 
be  seen  that  the  orders  of  the  eigenvalues  and  the  invariant  factors  are 
the  same,  i.e.  0  and  1. 

Based  on  the  proof  of  this  result  we  shall  observe  that  under  MSSNS 
the  eigenvalues  of  A(e)  are  clustered  in  n  groups  with  those  in  the  k-th 
group  close  to  the  eigenvalues  of  This  block  diagonal  dominant 
phenomenon  is  expected  in  view  of  the  appoximation  of  exp{A(e)t}  by 
TexpfAfeJtJT"1  if  the  system  satisfies  MSST.  It  is  also  true  when  MSSNS 


condition  is  satisfied.  This  observation  is  sunmarized  in  Theorem  3.2. 


The  second  result  in  Theorem  3.1  first  says  A(e)  satisfies  MSSNS  if 
and  only  if  the  set  of  gcd's  (greatest  common  divisors)  of  all  ixi 
principal  minors,  i=l,...,n  uniquely  determine  the  invariant  factors  and 
in  addition  we  have  the  no-cancellation  condition  mentioned  earlier. 
This  is  different  from  the  general  case  where  the  orders  of  all  ixi 
minors,  instead  of  principal  minors,  determine  the  invariant  factors. 
An  algorithm  is  also  given  to  actually  compute  the  invariant  factors. 
This  gives  us  another  potentially  very  useful  criterion  for  checking  the 
MSSNS  condition. 

In  Section  3.3,  we  present  Theorem  3.3,  which  defines  conditions 
equivalent  to  MSST.  They  include  two  known  conditions,  i.e.  1)  the  A^ 
being  Hurwitz  and  2)  the  system  having  well-behaved  time-scales,  and  3) 
a  new  condition  which  reveals  the  connection  between  the  MSSNS  and  MSST 
conditions  and  the  eigenvalues  of  A(e).  Specifically,  if  A(e)  satisfies 
MSSNS  and  the  orders  of  the  real  parts  of  its  eigenvalues  are  egual  to 
or  less  than  those  of  the  corresponding  imaginary  parts  then  A(e)  has 
MSST  and  vice  versa.*  This  condition  implies  that  in  addition  to  MSSNS, 
if  the  damping  rates  of  the  system  modes  are  equal  to  or  faster  than  the 
oscillation  rates,  then  well-behaved  time-scales  are  ensured  and  vice 
versa.  For  example,  consider  the  following  system, 

+  Because  of  our  assumption  that  A(e)  is  Hurwitz  for  e*(0,e0], 
we  knew  that  the  real  parts  of  the  eigenvalues  are  negative. 


where  A(e)  is  stable  and  satisfies  MSSNS  since 


1 

0 


which  obviously  is  invertible.  Note,  however  that  a^  =  -e+1.  Therefore 
the  damping  rate  here  is  of  order  1  but  the  oscillation  rate  is  of  order 
0.  Thus  the  condition  we  have  just  stated  is  violated  so  that  the 
system  does  not  have  well-behaved  time-scales.  Indeed,  solving  (3.1)  we 
have 


x^t)  =  x10e-etsint 
x2(t)  =  X2oe_etcost 
Substituting  c=crt  we  have 

Xi(t)  *  x^oe”csint/e 
x2(c)  =  x20e"ccosr/e 


Thus  the  time  behavior  of  x(t)  as  e — >0  shows  oscillation  but  no 
damping,  while  the  process  x(t)  does  not  have  a  limit  as  e — >0 


because  of  rapid  oscillations.  Note,  from  the  fact  that  A^  is  not 
Hurwitz  we  can  come  to  the  same  conclusion. 

In  Section  3.4,  we  present  an  algorithm  for  checking  MSSNS  based  on 
the  results  of  Section  3.2.  Finally,  in  Section  3.5  we  summarize  our 
results  and  present  an  additional  result  dealing  with  MSSNS  and  the 
detailed  eigenstructure  of  A(e). 


3.2  Equivalent  Statements  of  the  MSSNS  and  MSST  conditions 

In  this  Section  we  prove  several  statements  which  are  equivalent  to 
MSSNS  condition.  We  have  already  shown  in  Chapter  2.  that  the  MSSNS 
condition  is  equivalent  to  A^  being  non-singular.  In  this  section  we 
shall  prove  two  new  equivalent  conditions  —  1)  the  orders  of  the 
eigenvalues  of  A(e)  being  equal  to  the  orders  of  the  corresponding 
invariant  factors  and  2)  the  orders  of  the  gcd's  of  the  ixi  principal 
minors  of  A (e),  i=l,...,n,  uniquely  determining  the  invariant  factors 
and  certain  of  these  orders  being  equal  to  the  orders  of  the  sums  of  the 
corresponding  principal  minors.  As  mentioned  in  Section  3.1,  the 
intuition  behind  the  first  result  is  based  on  the  following 
observations.  We  have  shown  that  the  invariant  factors  of  A(e) 
determine  the  time-scale  structure  of  the  system  if  the  system  has  well- 
behaved  time-scale  structure.  On  the  other  hand,  the  eigenvalues 
determine  the  modes  of  a  system.  Therefore  if  the  system  has  well- 
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behaved  time-scale  behavior,  the  invariant  factors  and  eigenvalues 
should  have  the  same  orders.  In  this  section  we  prove  this  is  true  if 
and  only  if  the  system  has  MSSNS. 


The  second  result  is  closely  related  to  the  first  one.  We  shall 
see  that  the  orders  of  the  sums  of  the  principal  minors  uniquely 
determine  the  orders  of  the  eigenvalues.  Thus  if  these  orders  are  equal 
to  the  gcd's  of  the  principal  minors,  in  other  words  if  there  is  no 
complete  cancellation  of  the  lowest  order  terms  in  the  summation,  then 
the  latter  will  determine  the  orders  of  the  eigenvalues  too.  Therefore 
from  the  first  result  we  see  that  if  in  addition  the  gcd's  of  the 
principal  minors  also  uniquely  determine  the  orders  of  the  invariant 
factors  then  A(e)  must  have  MSSNS  because  its  eigenvalues  and  invariant 
factors  must  have  the  same  orders.  We  show  the  first  result  in  section 
3.2.1.  Then  in  section  3.2.2  we  prove  the  second  result  and  state  the 
entire  theorem. 


3.2.1  Orders  of  eigenvalues  and  invariant  factors 


To  show  the  first  result  we  need  the  following  lemma. 


[•] 


Then,  if  Alx  is  invertible 

detA  =  detA^2^e^22 

where 

a22  =  a22"A21A11~1a12 
Proof:  see  [  ]. 

Consider  a  matrix  A(e)  over  the  ring  T.  Denote  the  orders  ofthe 
eigenvalues  and  invariant  factors  by  b^b2^...».l,bn  and  £an 

respectively  and  the  corresponding  eigenvalues  are  Ap„„rAn.  The 
Smith  form  of  A(e)  is 


A(e)  =  P(e)D(e)Q(e) 

Let 

A1(e)  =  P~1{e)A(e)P(e)  =  D(e)Q(e)P(e)  =  D(e)X(e) 


where  I0f..../In_i  are  n0xn0/  n1xn1,....,nn_1xnn_]_  identity  matrices. 
Let  A^,  i=l,....,n  denote  the  successive  Schur  complements  of  A(0)  as 
described  in  Section  2.  Then  we  state  the  first  result  as  the 
following  lemma. 


Lemma  3.2 

A(e)  has  MSSNS  if  and  only  if  The  eigenvalues  and  invariant  factors 
have  the  same  orders;  i.e.  =  a^,  i=l,....,N. 


Proof 


Since  A^e)  is  similar  to  A(e),  they  have  the  same  eigenvalues. 
Furthermore,  because  P(e)  is  unimodular,  it  will  not  effect  the 
invariant  factors.  Therefore  we  shall  consider  A^e)  instead  of  A(e)  in 
our  proof.  We  shall  use  the  fact  that  A(e)  having  MSSNS  is  equivalent 
to  the  A.^,  i=l,...,n  having  full  rank,  a  fact  proven  in  last  section. 

We  first  prove  the  "only  if"  part. 

Define 

d(e,A)  =  det(e“^Ai(e)  -  *i) 


M 


Denote  d(e,A)  and  d-^e,*)  by  d  and  d^  respectively.  Then  the  difference 
between  them  is  that  in  d^  the  first  j-1  blocks  of  rows  of  the  matrix 
have  been  mutiplied  by  respectively. 

Therefore 

dj_  =  evd 

where 

v  *  n0kj+n1{kj-k1)+...+nj_1(kj-kj_1) 

Since  e  is  a  positive  number,  d^  and  d  will  have  the  same  zeros. 


It  is  easy  to  see  that  d^  is  a  continuous  function  of  e  at 


following  inequalities  hold.  Given  any  positive  number  o  there  is  an 


Cj  such  that 


I^Mj+i/®1^  _Ai(Ajj)l  <$  f  i=l,...,nj,  if  0<e<ej 


(3.4) 


where 


j-1 


Mj  =  2  n^  and  is  the 


(Mj+i)-th  eigenvalue  of  AjJe) 


Repeating  this  procedure  for  j=0,....,n-l  we  can  finally  prove  that 
for  any£>0,  there  is  an  en 


e^  —  min  {  Cq  f . .  •  •  t  } 


such  that  if  0<e<en  then 


~  ^i (^jj)  I  ^  ^  r  i=l/ .. .^nj, 
j=0,...,n-l 


(3.5) 


Since  A^(Aj-j)\0,  for  j=0,....,n-l,  i=lr . ,nj,  these  inequalities  tell 

ul  that  there  are  nQ  eigenvalues  of  order  kj_j.  These  are  exactly  the 
orders  of  the  invariant  factors  of  A(e). 

Now  let  us  prove  the  "if"  part. 

Suppose  that  Aj^  is  not  of  full  rank  but  A^,  i=0,...,j-l  are. 
Then  from  (3.4)  we  see  that  there  is  at  least  one  eigenvalue  of  Ajj,  say 
*i0(Ajj)*°  and  the  corresponding  eigenvalue  of  A^e),  Ai0,  has  an  order 


higher  than  kj.  This  implies  that  there  are  fewer  eigenvalues  with  the 
order  kj  than  there  are  invariant  factors. 

Now,  let  us  show  the  second  result  in  Theorem  3.1  which  relates  the 
invariant  factors  with  MSSNS  condition. 


3.2.2  The  Invariant  Factors  and  Principal  Minors  of  A(e) 

As  mentioned  in  3.2.1,  in  this  subsection  we  investigate  some 
properties  relating  the  MSSNS  condition  with  the  invariant  factors  and 
principal  minors  of  A(e).  We  prove  in  Lemma  3.4,  that  if  A(e)  satisfies 
the  MSSNS  condition  then  the  invariant  factors  will  be  uniquely 
determined  by  the  orders  of  the  gcd's  of  the  principal  minors  and  these 
orders  must  be  equal  to  the  orders  of  the  sums  of  the  principal  minors. 
In  Theorem  3.1  we  show  that  the  reverse  statement  is  also  true.  Also  a 
corollary  of  Theorem  3.1  shows  that  if  the  gcd  of  the  principal  minors 
are  computed  from  the  explicit  form  of  A(e)  then  the  following  statement 
is  true:  if  the  orders  of  the  gcd  of  the  principal  minors  uniquely 
determine  the  invariant  factors  then  A(e)  has  MSSNS.  That  is,  in 
explicit  form  the  no-cancellation  condition  is  automatically  satisfied 
if  the  invariant  factors  are  uniquely  determined  by  the  orders  of  the 
gcd's  of  the  principal  minors. 

It  is  well  known  that  the  eigenvalues  of  a  matrix  are  determined  by 
its  principal  minors  of  all  sizes.  Then  it  is  obvious  from  the  previous 
subsection  that  under  MSSNS  the  principal  minors  also  determine  the 
invariant  factors  of  A(e).  For  a  general  matrix  A(c)  (possibly  not 
possessing  MSSNS)  the  invariant  factors  can  be  determined  from  the  gcd's 
of  all  ixi  minors,  i=l,...,n.  We  shall  prove  that  under  MSSNS,  the  acd 
of  all  ixi  principal  minors,  i=l,....,n,  determine  the  invariant 
factors. 
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To  show  this,  we  first  prove  Lemma  3.3.  Then  we  show  that  the 
orders  of  the  god's  of  ixi  principal  minors  and  the  orders  of  i-th 
invariant  factors  satisfy  the  conditions  of  Lemma  3.3.  Based  on  this 
fact  we  prove  the  result. 

Now  let  us  first  state  and  prove  Lenma  3.3. 


LEMMA  3.3. 


Given  a  set  of  real  numbers  p^,  i=l,...,n,  there  is  a  unique  set  of 
real  nubers  s^,  i=l,...,n  such  that 

El  s^s^....^ 


F2 


3 


r 

i=l 


H  £  Pj  r 


j=l, . . . ,n 


E3 


if  SjHsj+i  or  j=n. 


Proof; 

This  result  is  most  easily  seen  pictorially.  In  Fig.  3.2.1,  the 


x's  denote  the  p^'s.  First,  we  draw  a  line  starting  from  origin  and 
passing  through  at  least  one  Pj_  and  leaving  all  p^'s  either  on  or  above 
this  line.  Obviously  there  is  one  and  only  one  line 


Figure  3. 1 


which  satisfies  this  requirement.  Let  pni  be  the  last  of  the  pj  that 
lie  on  this  line.  If  Pni^Pn  the  procedure  is  terminated.  If  not,  then 
starting  from  p^  we  draw  the  second  line  which  passes  through  at  least 
one  of  the  remaining  p^s  (nH^n^  and  leaves  the  other  Pj/s  either  on 
or  above  it.  Let  pn2  denote  the  last  point  through  which  this  second 
line  passes.  The  procedure  is  continuned  until  some  line  created  passes 
pn.  We  claim  that  the  slope  of  these  segr  mts  are  the  s^'s.  Namely, 

Si=S2=...=Sni=Pnl/nl'  snl+l=*— =sn2=(Pn2"Pnl)/(n2~nl)'  etc*  The  P°ints 
Pnl'  Ph2'  etc*  wil1  ^  turning  points. 

We  first  prove  that  this  is  one  solution.  First,  the  slope  of  each 
succesive  line  segment  is  always  larger  than  the  previous  one  because 
otherwise  the  previous  one  must  leave  sane  points  below  it.  This  proves 
condition  FI.  Condition  F2  follows  from  the  fact  that  all  p^'s  are 
either  on  or  above  these  segments.  Conditon  F3  holds  because,  for 
example,  the  segments  opn]_  and  PniPn2  must  have  different  slopes  by 


definition  of  these  segments. 


To  show  the  uniqueness,  suppose  we  have  another  solution 


n.  Let 


1 

2  s' 
j=l 


rri* 


Obviously,  t^ip^.  Otherwise 


F2  will  be  violated.  Suppose  rnl<pnl.  Since  Pi»*wPni  ate  either  on 
or  above  op^  by  definition,  the  line  or^  must  be  a  straight  line  too. 
If  this  were  not  the  case,  say,  at  some  point  nt  the  slope  changed,  then 
pnt  would  be  on  the  line  opnt  because  of  F3  and  pnt  would  be  oelow  op^, 
contradicting  the  construction  of  this  line  (see  Fig.  3.2.2).  In  a 
similar  fashion  we  can  deduce  that  the  curve  0-r1“r2'*».-rIn  cannot 
change  slope  at  any  point.  But  we  know  that  our  curve  O-p^-p^...  meet 
at  the  end  since  rn=pn  by  condition  F3.  This  contradicts  the  hypotheses 
that  rni<Pni*  The  same  argument  works  if  rnisPni  i=l,...,k-l  but 
rnk<Pnk* 


Figure  3.2  :  Illustrating  a  contradiction: 

If  rrj_<pn2  and  if  the  postulated  curve  from  0  to  r^  changes 
slope,  then  necessarily  pnt<pni,  contradicting  the 
construction  of  this  line 


Note:  It  should  be  pointed  out  that  the  algorithm  proposed  in  this 
section  is  actually  a  "Newton  polygon"  concept.  This  is  to  be  expected 
in  the  context  of  the  present  problem,  c.f.[19].  However,  we  have  not 
encountered  in  the  literature  a  statement  as  simple  as  that  given  in  the 
Lemma  or  to  be  given  in  statement  4  of  that  of  Theorem  3.1. 

Now  we  turn  to  the  relation  between  gcd's  of  principal  minors  and 
invariant  factors.  We  shall  show  in  the  following  Lemma  that  their 
orders  satisfy  F1-F3  in  Lemma  3.3. 

Let  us  first  define: 

p^  =  the  order  of  the  gcd  of  all  ixi  principal  minors. 

a^  =  the  order  of  the  i-th  invariant  factor. 

M^,  i=l,...,  f  j,  m=l,....,n  =  mxm  principal  minors  of  A(e). 

In* 

rm  =  Order (Z  Mi1”), 
i 

Then  we  have 

Lsroa  1A 

If  A(e)  Tnxn  has  MSSNS,  then  p^  and  a^  satisfy  F1-F3  (with  the 
playing  the  role  of  the  Sjh  Furthermore  rrn=pm  if  ^^^+1  (the  "no- 
cance  1 1  ation"  condition) . 

Proof; 

By  definition, 
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(3.6) 


j 

Also  £  a^  is  the  order  of  the  god's  of  all  jxj  minora  Therefore 
i=l 

Pl  and  a^  satisfy  Fl  and  F2.  To  show  F3,  let  us  recall  that  A(e)  having 
MSSNS  also  means  that  the  invariant  factors  of  A(e)  have  the  same  orders 
as  the  corresponding  eigenvalues.  In  other  words, 


”  a^,  i-l,...,n 


(3.7) 


where  are  orders  of  the  eigenvalues,  a^. 


It  is  known  that 


det(*I-A(e))  =  An  -  An-1(£>i)  +  An”2(  £ AAAj)  +....+ 
A]_»  •  •  *An 


=  An  -  An_1(  Z  M^)  +An_2(ZHi2)  +....+ Mxn 

(3.8) 


Where  M^m,  i*l,...,|  |,  m=l,....,n  are  principal  minors  of  A(e) . 


Conpare  the  coefficients  on  both  sides  of  (3.8)  we  see  that 


21  *£  AU....A 


lm 


(3.9) 


Now,  suppose  a^  H  ^+1*  Under  the  MSSNS  condition,  in  view  of  (3.6), 
(3.7),  we  see  that  among  all  the  terms  on  the  right  hand  side  of  (3.9), 


M**^m  ^as  ^e  smallast  degree,  say  d^  and  any  term  in  (3.9)  other  than 
Ai«.Am  niust  have  an  order  greater  than  d^.  Therefore 

m 

rm  =  Order(  Z  M^)  =  d^  =  Z  aA 
1  i=l 

=  order  of  gcd  of  all  mxm  minors  =  (3.10) 

Since  the  order  of  the  gcd  of  (i.e.  pro)  must  be  greater  than  or 
equal  to  (since  is  the  gcd  of  a  smaller  set  of  minors),  this 
equality  in  turn  means  that  is  equal  to  d^  That  is 
m 

?m  ai»  ^  ^  ^  am+l 
1=1 

n 

This,  together  with  the  fact  that  Z  -  order  (M-i n)  yields  F3, 

i=l 

and  (3.10)  gives  us  the  no-cancellation  condition. 

Before  we  state  and  prove  Theorem  3.1,  we  still  need  the  following 
lemma. 

Lemma  The  sets  of  numbers  r^  and  b^  satisfy  Fl  to  F3,  with  the  r^ 
playing  the  role  of  the  p^  and  the  b^  the  role  of  the  s^. 

Proof:  The  conditions  Fl  and  F2  are  obtained  simply  by  cheking  the 

two  sides  of  (3.8).  The  derivation  of  F3  is  similar  to  that  of  (3.10). 

Lemma  !*£  Define  q^  i=l,...,n  to  be  the  orders  of  the  gcd  of  all  ixi 
minors.  Then 


Note:  qj_=a1+....+an  by  definition  of  the  invariant  factors. 

The  proof  is  also  trivial.  Since  the  qj_  are  computed  from  a  larger 
set  of  minors,  the  second  inequality  is  then  proved.  The  first 
inequality  is  due  to  the  fact  that  the  order  of  the  sum  of  some  terms  is 
always  greater  than  or  equal  to  the  order  of  the  gcd  of  those  terms. 

Based  on  lemma  3.1  to  3.6  we  can  prove  the  following  theorem. 

Theorem  1*1 

Suppose  A(c)(Tnxn.  Then  the  following  statements  are  equivalent. 

1.  A(e)  satisfies  the  MSSNS  condition. 

2.  The  A^,  i=l,...,n  are  invertible. 

3.  The  orders  of  the  eigenvalues  are  equal  to  that  of  the 
invariant  factors,  i.  e.  a^=b^. 

4.  (a)  p^  and  a^^  i=l,...,n  satisfy  FI,  F2  and  F3. 

(b)  rm=p^  if  (the  no-cancellation  condition) . 

Proof: 

1  <==>  2:  This  has  been  proven  in  the  last  section. 

2  <=>  3:  This  has  been  proven  in  Lemma  3.2. 

3  ==>  4;  This  is  Lemma  3.4. 

4  «=>  3:  Define  p^c  the  lower  bound  of  the  convex  hull  of  p^. 
Refering  to  Figure  3.2.1,  p^,  i=l,...,n  are  the  corresponding  points 
lying  on  the  solid  line  in  the  figure.  We  will  on  occasion  refer  to  p^c 


as  the  lower  hull  of  p^.  Then  from  statement  4a  we  can  conclude  that 
PiC=ai+**,+ai*  But  this  is  in  fact  the  definition  of  q^.  Therefore  we 
have  proved  pj_c=q^.  Define  r^c  the  lower  bound  of  the  convex  hull  of 
r^.  Then  Lemma  3.6  and  statement  4b  imply  that  r^c=p^c.  Using  Lemma 
3.5  and  statement  4a  we  then  have  that  aj=b£,  for  i=l,...,n. 

Corollary  If  A(e)  is  in  explicit  form,  then  4a  implies  4b  in  Theorem 
3.5,  so  that  4a  by  itself  is  equivalent  to  MSSNS. 

Proof;  Since  A(e)  satisfying  the  MSSNS  condition  implies  that  its 
explicit  form  A^(e)  satisfies  the  same  condition  and  vice  versa,  we 
shall  consider  A-^e)  in  the  proof.  Let 


A^e) 


A11 

A12  ••• 

....  A-j^ 

eklh21 

Jtl« 

e  a22  • • • 

. .  eklA2n 

«ta-1«n2  - 

Then  from  4a  we  know 


l 

Z 


Z 

j=l J 


“  Pi' 


i=nQ,  n0+nl , . . . ,  n0+ . . .  +%_! ,  N 


because  an0-an0+1,  an0+ril“an0+nl+1,.... 


That  is,  the  order  of  the  gcd 


of  all  n0xn0  principal  minors  is  0.  The  order  of  the  gcd  of  all 
(nQ+n^)x(nQ+n^)  principal  minors  is  n^i^  and  so  on.  It  is  easy  to  see 
that  fno^no*0  since  A^  is  the  only  principal  minor  of  order  nQ  whose 
order  is  0.  Similarly,  the  submatrix  containing  A^,  A12'  ei^A21' 
eilA22  the  oniy  (ng+n^) xfnQ+n^  principal  submatrix  whose 
determinant  can  be  equal  to  n^k-^.  Thus  rnO+nl=:PnO+nl*  Continune  this 
argument  we  can  finally  prove  this  corollary. 


Note: 

Here  is  an  example  to  show  that  in  general  condition  4a  alone  is 
not  sufficient  to  ensure  the  MSSNS  condition.  Let 


A{e)  = 


1  -1 
1  -l+e?. 


Here 

Pi  «  0,2 

^i  —  0,2 


So  that  Pi  and  ai  satisfy  F1-F3.  But  A(0)  does  not  have  SS NS  as  is 
easily  seen  by  checking  that  both  its  eigenvalues  are  0  but  A(0)=0. 
This  is  due  to  the  fact  that  2b  is  violated:  ^=2^=0  while 

From  the  proof  of  Theorem  3.1,  especially  (3.3),  we  see  that  if 
A(c)  satisfies  MSSNS,  then  for  any5>0,  there  is  an  e0,  such  that  if 


0<e<e*  then 


or 


or 


I  Am  •  +i/e*  -Ai(Aii)|<d,  i=l,...,n-j 
3  j=0,...,n-l 


l>M-+i/ck  -  \(Ajj)  I  =  0(e) ,  i=l,...,ru 

3  j =0 (•••/ n-1 


1  *m-  +i  "  xi<ek  Ajj)  I  =0(ek5+1), 

3  j  ^0  ,  «  • » ,  n-1 


Therefore  we  have  the  following  theorem. 

Theorem  2*2: 

If  A(e)  has  MSSNS,  the  eigenvalues  of  A(e)  are  clustered  in  n 
groups,  with  those  in  the  k-th  group  lying  within  0(ekj+1)  of  the 
eigenvalues  of  e^Ajj. 

In  Section  3.4,  we  use  the  results  of  this  section  to  define  an 
algorithm  to  determine  the  invariant  factors  of  A(e)  frcm  the  gcd's  of 
its  principal  minors  under  the  condition  that  A(e)  satisfies  the  MSSNS 
condition.  Before  doing  that,  let  us  first  consider  some  equivalent 
conditions  for  MSST. 


3.3  Equivalent  statements  for  MSST  condition 


Having  seen  the  equivalent  conditions  for  MSSNS,  one  would  expect 
to  see  the  similar  conditions  for  MSST.  As  we  will  see  however,  there 
is  one  additional  property  of  the  eigenvalues  that  is  required  for  MSST, 


and  this  allows  us  to  gain  a  deeper  understanding  of  the  MSST  condition. 

Theorem  3.3: 

The  following  statements  are  equivalent. 

1.  A(e)  has  MSST. 

2.  The  A^,  i=l,...,n  are  Hurwitz. 

3.  The  system  x=A(e) x  has  well-behaved  time-scale  structure. 

4.  A(e)  has  MSSNS  and  0[Re(Ai)J  <  0[Im(Ai)]  for  i=l,...,n. 

where  the  A^  are  the  eigenvalues  of  A(e)  and  the 
OtftetAjj]  and  OlImCA^]  are  the  orders  of  the  real 
and  imaginary  parts  of  the  eigenvalues  of  A(e) 
respectively. 

The  first  three  statements  were  proven  in  the  last  chapter.  Here 
we  only  need  to  prove  2  <=>  4.  First  let  us  prove 
4  ===>  2. 

Since  A(e)  has  MSSNS,  from  theorem  3.1  we  know  that  the  eigenvalues 
of  A{e)  are  clustered  in  n  groups.  Denote  the  eigenvalues  of  A(e)  by 

An,  » *21'  ••••  '*kl'  *kn 


,....,  Ann  ,  where,  from 


«_  \  *.  %  %  •  I 


Theorem  3.2  and  the  MSSNS  assumption 

=  A^(e^^Ajj)  +  0(e^+^-) ,  i=l,...,nj 

j=l, • • • t n 

Since  MSSNS  implies  that  Ajj  is  invertible,  we  see  that 

limtAji/e^)  =  H  0  (3.11) 

Furthermore,  since  we  have  assumed  the  0[Re(Aki)]  _<  Oflm^*)],  we  can 
immediately  conclude  that 

cj;L  =  lim(Re(Aj i)/ek3)  =  Re(A.(Ajj))  \  o 

Since  A(e)  is  Hurwitz  for  €f(0,e0)  we  have  that  Cji<0,  which  in  turn 
implies  that  Ajj  is  Hurwitz. 

2  ==>  4 

We  have  already  seen  that  2  ==>  that  A(e)  has  MSSNS.  Since  (3.11) 
still  holds  and  Re  (Ajj)]  <  0  by  assumption,  we  immediately  have  that 
0[Re(Aji)]  _<  o[Im(Aji], 


As  commented  in  Section  3.1,  the  fourth  statement  in  this  theorem 
states  that  if  the  system  has  MSSNS  and  if  in  addition  the  damping  rate 
is  at  least  as  fast  as  the  oscillation  rate  (i.e.  the  orders  of  the 
real  parts  of  the  eigenvalues  are  equal  to  or  less  than  the  orders  of 
the  corresponding  imaginary  parts)  then  the  system  has  well-behaved 
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time-scales.  The  reverse  is  also  true. 

Thus,  in  this  section  we  have  proven  a  result  which  relates  the 
MSST  condition  with  several  other  conditions.  In  the  following  section 
we  use  the  results  of  Section  3.2  to  define  an  algorithm  to  determine 
the  invariant  factors  of  A(e)  from  the  gcd's  of  its  principal  minors 
under  the  condition  that  A{e)  satisfies  the  MSSNS  condition. 


3.4  An  Algorithm  For  Determing  The  Invariant  Factors  Of  A(e) 

Given  certain  real  numbers  p^r  i=lf...,n,  there  is  a  simple 
algorithm  to  determine  a^  which  satisfy  F1-F3  in  Lemma  3.3.  The 
algorithm  proceeds  as  follows.  The  first  objective  is  to  find  the  slope 
of  the  initial  line  segment  and  the  first  "turning  point".  We  do  this 
by  computing  for  each  p^  the  slope  Uj=b^/i  of  the  line  from  the  origin 
to  the  point  p^.  Then  the  initial  slope  is  the  smallest  of  the  u^  and 
the  first  turning  point  corresponds  to  the  largest  value  of  i  that  has 
this  minimum  value  of  u^.  From  this  point  we  essentially  repeat  the 
process  by  looking  at  the  slopes  of  lines  from  this  turning  point  to 
each  of  the  p^  corresponding  to  subsequent  values  of  i.  A  flow  graph  of 
an  alg^  rithm  that  accomplishes  this  is  depicted  in  Figure  3.3  .  As 
indicated  in  this  flow  graph,  each  subsequent  slope  and  turning  point 
computation  can  be  reduced  to  the  original  one  by  shifting  the  previous 
turning  point  to  the  origin. 


The  algorithm,  illustrated  in  Figure  3.4.1,  proceeds  as  follows 
1.  1=6 

2.  qj=  3,  1,  2/3,  3/4,  4/5,  1 

3.  It  is  easy  to  check  that  m=3,  <^=2/3. 

4.  a^ =a2=32=^/3 • 


5.  rrpn=6.  therefore  continue 


6.  l=6-3=3,  Pj_=lf 2,4. 

7.  Repeat  1-4.  We  have  a4=a5=l. 

8.  This  time,  1=1,  p^=2=ag.  Stop. 

So  we  have  aA  =  2/3,  2/3,  2/3,  1,  1,  2. 

The  next  two  examples  show  how  to  actually  compute  the  invariant  factors 
from  a  matrix  satisfying  the  MSSNS  condition. 


Example  3.4.2 


A(e) 


1 

c 

1 

c 


c  - 

1  e2 
1  e 
cP  - 


The  order  of  the  gcd  of  Mi1  is  obviously  0,  because  there  are  diagonal 
elements  of  A(e)  that  have  order  0.  Since  one  of  the  2x2  principal 
minor  is  1,  we  also  have  that  P2=0.  We  then  find  that  pj  is  determined 
by  the  upper  left  3x3  minor  which  is  e.  Also  det(A(e))=e®.  So  that 

Pl=P2=0f  £>3=1,  p4=5. 

If  we  also  know  that  A(e)  satisfies  MSSNS  condition  (this  is  true  for 
this  example),  then  using  our  algorithm  we  have 


al=a2=t)l=b2=0 
a3=b3=1'  a4=b4=5. 


This  algorithm  in  fact  also  provides  a  way  to  check  the  MSSNS  condition. 
We  first  observe  that  r^=p^.  Then,  since  there  is  a  3x3  principal  minor 
whose  order  is  1  and  there  is  no  other  3x3  minor  of  lower  order,  we  know 
that  the  invariant  factors  are  0,0,1  and  5.  The  numbers  determined  by 
p^  according  to  the  algorithm  are  0,0,1  and  5  which  are  equal  to  the 
orders  of  the  invariant  factors.  Therefore  statement  4  of  Theorem  3.1 
tells  us  that  A(e)  has  MSSNS. 

Example  3.4.3 

e  1  1  1 

1  e7  e  e 

e2  e3  0  c3 

e6  ^1  ell- 

It  can  be  seen  that 

Pl=l,  P2=°»  P3=3,  p4=14 . 

As  in  the  preceding  example,  we  can  check  that  this  A(e)  satisfies 
MSSNS,  and  our  algorithm  then  yields  the  invariant  factors: 


ai=a2=0f  St =3 ,  a4=ll 


3 . 5  Conclusion 


The  new  results  in  this  Chapter  are  the  following. 

1.  New  equivalent  conditions  for  MSSNS  —  1)  the  orders  of 
the  eigenvalues  being  equal  to  those  of  the  invariant 
factors.  2)  A(e)  satisfies  the  no-cancellation 
condition  and  the  uniquely  determine  a^  (Theorem 
3.1). 

2.  A  new  equivalent  condition  for  MSST  —  A(e)  satisfies 
MSSNS  and  in  addition  the  orders  of  the  real  parts  of 
the  eigenvalues  are  equal  to  or  less  than  those  of  the 
corresponding  imaginary  parts  (Theorem  3.3). 

3.  If  A(e)  has  MSSNS,  then  the  eigenvalues  of  A(e)  are 
clustered  in  n  groups  (Theorem  3.2). 

Based  on  these  new  results  one  can  derive  several  conclusions. 

1.  The  new  condition  for  MSSNS  does  not  require  us  to  compute  the 
successive  Schur  complements  to  determine  if  a  system  satisfies  this 
condition.  Instead,  if  the  orders  of  the  eigenvalues  and  invariant 
factors  have  been  computed,  then  we  can  easily  see  if  A(e)  has  MSSNS  as 
pointed  out  in  Theorem  3.1,  statement  3.  This  result  is  in  fact  the 
basis  for  our  approach  to  scaling  developed  in  the  next  chapter.  This 
scaling  involves  the  construction  of  a  non-unimodular  similarity 


transformation  so  that  the  orders  of  the  invariant  factors  can  be 
changed  to  be  eaual  to  the  orders  of  the  eigenvalues  (which  of  course 
are  left  unchanged).  If  in  addition  the  real  and  imaginary  parts  of  the 
eigenvalues  of  A{e)  satisfy  the  order  condition  as  stated  in  Theorem 
3.3,  then  after  scaling  the  new  system  matrix  will  not  only  have  MSSNS 
but  also  MSST.  So  that,  as  stated  in  Section  2.5  and  2. 6, the  system 
will  have  well-behaved  time-scales  and  the  results  in  those  sections  can 
apply.  If  this  condition  is  not  satisfied,  then  using  the  results  in 
Chapter  2,  Section  2.8,  after  scaling  we  can  still  have  the  extended 
well-defined  time  scale  behavior. 

The  second  approach  for  checking  the  MSSNS  condition  is  to  use 
statement  4  in  Theorem  3.1.  Namely,  we  first  check  if  r^p^,  when 
an,Ham+i,(the  no-cancellation  condition).  If  the  answer  is  yes  then  we 
conclude  that  the  system  does  not  have  MSSNS.  If,  on  the  other  hand, 
they  are  equal,  we  proceed  to  check  if  the  gcd's  of  the  principal  minors 
and  the  invariant  factors  satisfy  F1-F3.  The  third  method  is  to  make 
use  of  the  corollary  of  Theorem  3.1.  That  is,  we  first  obtain  the 
explicit  form  of  A(e).  Then  compute  the  p^  from  the  explicit  form  and 
check  if  they  and  a^  satisfy  F1-F3. 

2.  The  new  condition  for  MSST  is  not  only  a  new  criterian  for  MSST, 
it  is  also  a  bridge  between  MSSNS  and  MSST.  In  other  words,  if  A(e)  has 
MSSNS  then  we  should  only  check  the  orders  of  the  real  and  imaginary 
parts  of  its  eigenvalues  to  make  sure  if  it  has  MSST. 


3.  Theorem  3.2  provides  a  way  to  check  the  approximate  locations 
of  the  eigenvalues  if  A(e)  satisfies  MSSNS.  In  principal,  if  A(e)  does 
not  satisfy  MSSNS,  but  we  can  find  a  scaling  to  bring  it  to  MSSNS,  then 
this  theorem  can  still  apply  to  the  transformed  system.  It  holds 
regardless  of  the  stability  of  the  system.  It  also  suggests  a 
connection  to  frequency  domain  consideration.  That  is,  even  if  A(e) 
does  not  satisfies  the  stability  condition  of  Theorem  3.3,  as  long  as  it 
satisfies  the  MSSNS  condition,  we  should  expect,  in  view  of  its 
clustered  eigenvalues,  well-defined  multiple  frequency-scales  (cf. 
[20],  [21]).  In  other  words,  the  frequency  response  of  the  original  system 
should  be  able  to  be  approximated  by  the  frequency  response  of  a 
decoulpled  frequency-scale  separated  system  as  the  one  suggested  in 
Chapter  2,  Theorem  2.1. 

This  Chapter  has  dealt  to  a  great  extent  with  the  relationship 
between  time  scales,  invariant  factors,  and  eigenvalues.  A  natural 
question  to  ask  is  the  relationship  between  time  scales  and  the  complete 
eigenstructure  of  A(e).  In  particular,  let  us  point  out  one  result  on 
the  relationship  between  the  Jordan  form  of  A(e)  and  the  MSSNS 
condition.  Let 

A(e)  =  M(e)A(e)M_1(€) 

where  A(e)  is  the  Jordan  form  of  A(e)  and  M(c)  is  normalized  (i.e.  the 
eigenvectors  are  chosen  to  have  the  unit  length).  We  then  have  the 
following  reault.  Suppose  that  M(e)  is  unimodular.  Then  A(e)  has  MSSNS 


if  and  only  if  A(0)  has  SSNS  (i.e.  if  it  contains  no  diagonal  blocks  of 
dimension  greater  than  1  with  0's  on  the  diagonal).  This  follows  from 
the  observations  that  (1)  A(e)  has  MSSNS  if  and  only  if  A(0)  has  SSNS 
and  (2)  M(e)  is  unimodular.  Let  us  give  some  examples  and  make  several 
comments  about  this  result.  Consider  the  matrix 

e 

A(e)  = 

0 

In  this  case  M(e)=I  and  A(0)=0,  from  which  we  can  deduce  that  A(e)  has 
MSSNS.  On  the  other  hand,  consider 

A(e)  = 

In  this  case  M(e)=I  but  A(0)  does  not  have  MSSNS. 

It  is  important  to  point  out  that  there  are  many  cases  in  which 
M(e)  is  not  unimodular.  In  this  case  it  is  perfectly  possible  for  A(e) 
to  have  MSSNS  even  if  A(0)  does  not  have  SSNS  or  for  A(e)  not  to  have 
MSSNS  even  if  A(0)  does  have  SSNS.  Let  us  illustrate  this  with  two 
examples. 


CHAPTER  4 


SCALING 


4.1  INTRODUCTION 

4.1.1  Motivation 

Having  seen  the  decisive  role  that  MSSNS  and  MSST  playr  we  now 
consider  systems  that  do  not  have  MSSNS.  In  this  chapter  we  show  that 
under  specified  conditions  a  scaling  transformation  on  the  states 
produces  a  system  that  has  MSSNS,  so  that  the  formulation  in  previous 
chapters  can  apply. 

We  have  already  seen  in  Chapter  3  that,  when  the  system  does  not 
have  MSSNS,  the  orders  of  eigenvalues  and  invariant  factors  are  not 
equal,  and  conversely.  The  only  way  to  make  the  system  have  MSSNS  is 
therefore  to  change  the  orders  of  its  eigenvalues  and/or  those  of  its 
invariant  factors.  We  choose  the  latter,  since  this  will  not  change  the 
system’s  dynamics  and  is  more  direct  to  implement.  In  fact,  a 
similarity  transformation  through  a  non-unimodular  matrix  has  the 
potential  to  meet  this  requirement,  since  it  will  not  change  the 


eigenvalues  but  can  modify  the  invariant  factors. 


This  can  be  clarified  by  a  simple  example  that  we  have  inspected 
before,  namely 


» 

x 


-e  1 
0  — € 


X 


As  mentioned  in  Chapter  3,  Section  3.1,  this  system  does  not  have  MSSNS 
since  the  orders  {b^}  of  its  eigenvalues  are  1,1  but  the  orders  {a^}  of 
its  invariant  factors  are  0,2.  This  failure  to  satisfy  the  MSSNS 
condition  could  also  be  deduced  from  Fig.  4.1.  From  this  figure,  we 
see  that  the  slopes  of  the  lower  bound  of  the  {p^}  are  1  and  1.  Recall 
that  points  on  the  lower  bound  of  the  {p^}  were  defined  as  p^c  in  the 
last  chapter.  Recall  also  that  we  defined  rA  as  the  order  of  the  sum  of 
all  ixi  principal  minors,  and  as  points  on  their  lower  bound.  Since 
in  this  example  r^^p^0,  these  slopes  are  also  the  {bjj.  Also,  the 
invariant  factor  orders  are  0  and  2,  corresponding  to  the  slopes  of  the 
dotted  line  segments  making  up  the  plot  of  versus  i,  where  we  recall 
that  was  defined  as  the  gcd  of  all  ixi  minors. 


Figure  4.1  A  simple  example 


Absence  of  MSSNS  inplies  that  the  system  does  not  have  well-behaved  time 
scales.  For  this  example,  as  we  have  seen  in  Section  3.1,  x-^t)  becomes 
unbounded  at  the  time  scale  et  as  e  goes  to  zero. 

However,  if  we  apply  the  following  linear  transformation 


which  is  simply  a  (diagonal)  scaling  of  state  variables,  we  obtain  the 
system 


For  this  new  system,  it  can  be  seen  that  the  orders  of  eigenvalues  and 
invariant  factors  are  the  same,  so  the  system  satisfies  MSSNS.  It  is 
furthermore  evident  that  the  system  satisfies  MSST  as  well,  so  results 
we  obtained  in  previous  chapters  apply  to  it.  Returning  to  Fig.  4.1.1, 
we  note  that  the  orders  {p^}  of  the  gcd's  of  the  principal  minors  are 
unchanged  by  diagonal  scaling,  but  the  orders  {q^}  of  the  gcd's  of  all 
minors  have  been  raised  by  scaling,  so  that  the  two  curves  now  coincide. 
The  solution  of  the  scaled  system  is  given  by: 

Yl(t)  ■  exx(t)  =  ex10e-€t  +  x20ete-et 

y2(t)  =  x2(t)  =  x20e-et  (4.1) 

Substitute  t  for  et  to  get 

y l U)  =  cx10e_c  +  x20te“^ 
y2(t)  *  x20e“c 

Obviously  this  scaling  has  served  to  avoid  the  problem  of  the  unsealed 
system,  in  that  the  result  remains  bounded  for  time  scales  c  and  et  as  £ 
goes  to  0. 

Ihe  significance  of  scaling  is  two-fold.  First,  as  we  have  seen  in 
Chapter  3,  Theorem  3.2,  if  the  system  matrix  satisfies  the  MSSNS 
condition,  then  its  eigenvalues  can  be  approximated  by  the  eigenvalues 
of  its  successive  Schur  complements.  Hence,  if  A(e)  does  not  have  MSSNS 


but  we  can  find  a  scaling  so  that  the  resulting  matrix  does,  then  we  can 
still  obtain  a  good  approximation  to  its  eigenvalues  via  reduced-order 
computations,  because  scaling  preserves  eigenvalues.  Second,  since  the 
scaled  system  has  MSSNS,  the  procedure  described  in  the  previous  chapter 
will  provide  a  time  scale  decomposed  system  to  approximate  it,  provided 
MSST  is  also  satisfied.  We  conjecture  that  the  solutions  of  this  system 
can  provide  a  good  approximation  to  those  of  the  original  system,  in  a 
sense  that  will  be  discussed  at  the  conclusion  of  this  chapter. 


4.1.2  Summary  of  the  Scaling  Procedure 

This  chapter  concentrates  on  the  development  of  a  scaling  procedure 
that  can  be  applied  to  a  large  class  of  systems.  The  procedure  involves 
the  following  steps.  The  first  step  is  to  transform  A(e)  to  its 
explicit  form,  Ae(e)  =  P“^(e) A(e)P(e)  =  D(e)£(e).  This  step  does  not 
change  either  the  invariant  factors  or  the  eigenvalues  of  A(e). 

The  second  step  involves  identifying  what  we  term  a  skeleton  in  the 
explicit  form.  A  skeleton  consists  of  n  entries  of  P^{e),  precisely  one 
from  each  row  and  column/  with  orders  equal  to  the  orders  of  the  rows  in 
which  they  are  located.  By  the  order  of  the  i-th  row,  we  mean  the  order 
ai  Of  the  invariant  factor  of  this  row  of  the  explicit  form  matrix 
AgCe).  Each  skeleton  element  therefore  has  minimal  order  for  its  row. 
Since  A(e)  has  full  rank,  it  is  easy  to  see  that  there  is  at  least  one 
skeleton  in  Ag(e). 

Now  identify  the  skeleton  above  with  the  nxn  permutation  matrix 
that  has  l's  at  the  locations  of  the  skeleton  elements  and  0's 
elsewhere.  It  is  really  only  this  pattern  of  the  skeleton's  placement 
within  Ag(e)  that  we  shall  be  using  in  what  follows,  so  we  shall  usually 
talk  of  the  skeleton  as  if  it  is  the  permutation.  Any  permutation  can 
be  uniquely  expressed  as  a  product  of  disjoint  cycles  (see  subsection 
4.2.1).  It  follows  from  this  that,  perhaps  after  some  re-ordering  of 
the  variables  associated  with  our  system  (which  corresponds  to 
similarity  transformation  by  a  symmetric  permutation),  the  elements  of 


the  skeleton  can  be  brought  to  the  positions  occupied  by  the  l's  in  a 


block  diagonal  canonical  circulant  matrix  (BDCM).  whose  diagonal  blocks 
take  the  form: 


f 

This  re-ordering  of  variables  is  the 
third  step  of  our  procedure.  Let  a^  denote  the  order  of  the  skeleton 
element  in  the  i-th  row,  following  the  re-ordering  of  variables.  The 
set  {ajJ  is  the  same,  therefore,  .as  the  set  {a1<a2i-..<aN}  of  invariant 
factor  orders. 

For  the  moment,  let  us  assume  that  there  is  only  one  block  in  the 
BDCM,  so  as  to  simplify  our  introduction.  'Then,  under  some  assumptions 
on  these  integers  a^  and  on  the  principal  minors  of  Ag(e),  the  following 
scaling  can  be  shown  to  transform  the  matrix  to  one  that  satisfies 
MSSNS: 


C  = 


0  1 


or  simply  [1]  for  a  scalar  block. 


S(e)  =  diag{eSl,  eS2,  ...  ,  e^1,  1},  (4.2) 

where 

si  ®  si+l  +  ^i  ”  ®i»  ®n  =  ® 

and  the  b^  are,  as  before,  the  orders  of  the  eigenvalues.  The  fourth 
(and  final)  step  of  our  scaling  procedure  is  thus  the  application  of 


this  diagonal  scaling  transformation  to  the  system 

In  summary,  the  procedure  for  finding  a  diagonal  scaling  that 
induces  MSSNS  is 

1.  Transform  A(e)  to  its  explicit  form  Ag(c). 

2.  Identify  a  skeleton  in  Ag(c). 

3.  Apply  a  symmetric  permutation  T  to  Ag(e)  so  that  the 
skeleton  of  A^e)  =  TAg(e) T-1  corresponds  to  a  BDCM. 

4.  Calculate  S(c)  *  diag{csJ-,...esN}  for  A^e)  so  that 
A2(e)  =  S(e)A1(c)S"1(e)  =  S(c)TAe(s)T1S”1(e) 

has  MSSNS. 

A  precise  statement  of  sufficient  conditions  for  this  procedure  to 
succeed,  as  well  as  the  details  of  Step  4  for  a  system  with  a  multiple- 
block  skeleton,  are  given  in  Section  4.2,  along  with  proofs  of  the  main 
assertions. 

Note  1  We  shall  suppose  in  the  remainder  of  this  chapter  (unless 
otherwise  specified)  that  A(e)  is  already  in  its  explicit  form,  i.e. 
that  step  1  has  already  been  carried  out. 

Note  2  Rather  than  using  the  symbols  {a ^ }  for  the  reordered  set 
{a^},  we  shall  from  now  on  in  this  chapter  use  {a^}  as  generic  symbols 
for  invariant  factors,  with  ordering  determined  by  the  context. 

Note  2  In  the  next  section  we  shall  state  several  conditions  that 
are  together  sufficient  for  our  procedure  to  work.  Before  doing  this. 


we  first  note  an  important  necessary  condition  for  our  scaling  procedure 
to  succeed.  From  the  description  of  the  procedure  above,  we  see  that 
only  symmetric  permutations  and  diagonal  scaling  will  be  applied  to  the 
explicit  form  Ae(e).  Obviously  these  operations  have  no  effect  on 
principal  minors.  However,  from  Theorem  3.1  we  also  know  the  condition 
that  pj0  =ric,  i=l,...N,  i.e.  that  the  {pj_}  uniquely  determine  the  {bjj, 
is  a  necessary  condition  for  Ae(e)  to  have  MSSNS.  (This  condition  was 
called  the  "no  cancellation"  condition  in  the  previous  chapter.) 
therefore,  if  after  scaling  we  have  the  MSSNS  condition,  so  that  the  no 
cancellation  condition  holds,  it  must  hold  for  the  original  matrix.  We 
shall  take  this  necessary  no-cancellation  condition  as  a  standing 
assumption  throughout  this  chapter. 


4.1.3  Outline  of  this  chapter 

Section  4.2  is  devoted  to  filling  out  the  outline  above.  In 
Section  4.2.1  we  briefly  discuss  some  properties  of  permutations; 
further  details  can  be  found  in  Appendix  4A.  Then  in  Section  4.2.2  we 
introduce  the  essential  idea  of  our  approach  through  a  key  example. 
Section  4.2.3  deals  with  diagonal  scaling  for  the  simplest  case  where 
there  is  only  one  cycle  in  the  skeleton.  Section  4.2.4  extends  this 
result  to  the  more  general  case  where  the  skeleton  has  several  cycles. 


Since  our  scaling  procedure  is  derived  under  some  assumptions,  it 


is  important  to  know  that  there  are  interesting  problems  where  these 
assumptions  are  indeed  satisfied.  We  describe,  in  Section  4.3,  the 
application  of  our  scaling  procedure  to  high-gain  feedback  problems.  In 
particular,  we  show  that  our  procedure  leads  to  scalings  used  by  Sannuti 
in  [14].  (This  paper,  along  with  [15],  motivated  our  study  of  scaling 


in  the  first  place.) 


4.2  Diagonal  Scaling 


4.2.1  Introduction 

In  this  section,  we  examine  matrices  A(e)  with  structural  features 
that  cause  them  to  not  have  MSSNS.  We  show  that  it  is  possible  (under 
certain  conditions)  to  choose  a  scaling  of  such  a  matrix  that  induces 
MSSNS. 

Without  loss  of  generality,  it  is  assumed  that  A(e)  is  in  its 
explicit  form  (see  Note  1  of  Section  4.1).  As  mentioned  in  that 
section,  we  shall  be  considering  transformations  of  the  form  S(e)T, 
where  S(e)  is  a  diagonal  scaling  matrix,  T  is  a  permutation  matrix,  and 

S(e)  =  S(e)TA(e) faster1  -  S(e) A(e)3(e)_1  (4.3) 

Note  that  A(e)  and  A(e)  have  the  same  eigenvalues  but  may  have  different 
invariant  factors  if  S(e)  is  not  unimodular.  We  shall  show  how  S(e)  can 
be  chosen  so  that  the  invariant  factors  and  eigenvalues  of  £(e)  have  the 
same  orders,  even  when  A(e)  does  not  possess  this  property. 

Hie  development  of  our  scaling  procedure  involves  identifying  and 
manipulating  permutation  matrices  that  reflect  important  structural 
features  of  A(e).  Before  we  begin  this  development,  therefore,  let  us 
first  review  some  properties  of  permutation  matrices,  i.e.  of  matrices 


obtained  by  permuting  the  rows  and/or  columns  of  the  identity  matrix  I, 


A  permutation  matrix,  say 


P  = 


"0 

0 

0 

1 

0 

Lo 


o 

0 

1 

0 

0 

0 


0 

0 

0 

0 

0 

1 


1 

0 

0 

0 

0 

0 


0  o' 
0  1 
0  0 
0  0 
1  0 
0  0. 


can  also  be  represented  in  terms  of  the  cycles  that  comprise  it.  For 
the  example  above,  the  decomposition  of  P  into  cycles  is  given  by 

6p  =  (1,4) (2,3,6) (5) 

This  notation  serves  to  indicate  that  if  P  is  applied  to  a  vector 
x=(x1,x2,~.,x6),  then  the  following  cycling  of  elements  occurs: 

[X-L-- >X4,  X4— >X1],  [X2— >X3,  X3~>X6,  X6~>X2],  [X5— >X5].  HereOp 
consists  of  three  cycles,  which  are  uniquely  defined. 

The  canonical  circulant  matrix  is  a  permutation  of  the  form 

=  (n,n-l,  n  2 ,  ...  ,  2,  1) 

This  corresponds  to  the  permutation  or  circulant  matrix 

ro  1  0 . 01 

0  0  1 . 0 

0  0  0  . 0 


0  0  0  . 1 

Li  o  o . oJ 


A  block  diagonal  canonical  circulant  matrix  (BDCM)  is  defined  as 
C  =  block  diagonal  {Cp  C2,...,  C^} 
where  Cp  i=l,...,  m  are  canonical  circulant  matrices. 

It  can  be  shown  (see  Appendix  4A)  that  any  permutation  matrix  can 
be  brought  to  a  BDCM  by  a  symmetric  similarity  transformation  T  that  is 
itself  a  permutation  matrix: 

P  =  TPT-1,  P  is  BDCM  form. 

We  shall  call  this  transformation  T  a  symmetric  permutation.  Each  block 
of  P  corresponds  to  one  cycle  of  9p. 

In  the  next  subsection  we  introduce  our  approach  to  scaling  through 
an  example  that  also  serves  to  illustrate  the  role  of  circulant  matrices 
in  our  development. 


4.2.2  A  Key  Example 

The  essential  ideas  underlying  the  scaling  approach  proposed  in 
this  section  are  exposed  by  the  following  key  example.  To  proceed,  let 
us  first  recall  some  definitions  from  the  previous  chapter: 

a^  =  orders  of  invariant  factors,  i=l,...,N 


b^  =  orders  of  eigenvalues. 


i=l , » • • , N 


i 

Z 

and,  based  on  the  no  cancellation  assumption,  the  points  on  the  lower 
bound  of  the  {p^}  are  given  by 

Pic  =  Cic  =Z  bj, 

j-1  J 

Now,  suppose  A(e)  has  the  following  special  form: 


'0  e*1  0  0 . 0 

0  0  e*2  0 . 0 

A(e)  =  .  (4.4) 

0  0  0  0  . e3-11"1 

e  n  0  0  0 . 0 


Note  that  A  becomes  the  canonical  circulant  matrix  if  nonzero  entries 
are  replaced  by  1;  we  shall  therefore  say  A  has  the  structure  of  a 
circulant  matrix.  It  is  easy  to  see  that  A  is  in  explicit  form,  and 
that  the  orders  of  its  invariant  factors  are  a^,  i=l,...,n.  (Recall 
from  Note  2  of  the  previous  section  that  we  are  still  using  { a^ }  to 
denote  the  orders  of  the  invariant  factors,  but  they  may  not  be  ordered 
as  in  earlier  chapters.)  Since  the  ixi  pricipal  minors  with  i<n  are  all 
zero  (this  implies  rj=Pi=  »  i=l,...,n-l)  and  rn=pn=qn=ai+...+an,  we  can 


draw  the  following  picture: 


It  is  easy  to  see  from  this  picture  that  the  orders  of  the  eigenvalues 
bif  which  are  determined  by  r^^p^0,  are  equal  to  b=qn/n  for  all  i. 
This  can  also  be  obtained  by  checking  the  characteristic  polynomial  of 
A,  which  is 

An  -  cV 

where  v=qn.  This  again  shows  that  all  of  the  b^  are  equal,  namely 
b^=b=qn/n  for  i=l,...,n.  Thus  A  will  not  have  MSSNS  unless  all  of  the 
aj,  are  the  same. 

Now  let 

S  =  diag{eSl,  e^2,  e5n-1,  1} 


E 


Then  after  scaling 


0  e5  0  0 
0  0  fP  0 


£  »  SAS"1  = 


0  0  0  0 

€*>0  0  0 


and  the  invariant  factors  of  £  have  the  same  orders  as  its  eigenvalues. 

If  A  does  not  have  the  structure  of  a  circulant  matrix,  but  of  a 
more  general  permutation  matrix,  then  we  can  apply  a  symmetric 
permutation  to  bring  it  to  block  circulant  structure.  For  example, 
consider 


"0  0  e31' 

A  =  e33  0  0 

0  e33  0 


As  with  the  last  example,  A  is  in  explicit  form  and  the  orders  of  the 
invariant  factors  are  ap  a2  and  a3.  The  orders  of  the  eigenvalues  are 
all  equal  to  b=(a1+a2-*-a3)/3.  Choosing  the  symmetric  permutation  matrix 


we  obtain 

0  e32  0 

0  0  e31 

e33  0  0 

To  bring  A-]_  to  MSSNS,  we  can  choose 
S  =  diagie31,  6s3,  1} 

where  s1=2b-a1-a2,  s2=b-a1.  Applying  S  to  Ap  we  obtain 


Ax  =  TAT-1  = 


'0 

eS2 

0  “ 

I 

o 

p. 

o 

STAT~1S“1  = 

0 

0 

A 

e_s2  0  0 

.0 

0 

1  _ 

.0  0  1  , 

=  SAS-1  = 


0  e5  0 

0  0  e5 

e5  0  0 


In  general,  the  entries  of  A  that  were  zero  in  this  example  will 
not  be  zero.  Nevertheless,  the  essential  idea  can  still  be  carried  over 
to  the  more  general  situation,  under  some  conditions.  In  the  next 
subsection  we  start  with  the  special  case  where  A  has  the  form  of 


(4.4) ,  except  that  the  zero  entries  of  (4.4)  might  no  longer  be  zero 
(though  they  are  still,  in  some  sense,  subsidiary).  In  other  words,  we 
shall  assume  that  there  is  a  skeleton  in  A,  as  defined  in  Section  4.1, 
whose  elements  correspond  to  the  l's  in  a  circulant  matix  with  one 
cycle.  We  move  to  the  more  general  case,  where  the  skeleton  elements 
form  a  permutation  matrix  with  several  cycles,  in  the  subsection  after 


the  next  one. 


4.2.3  Scaling  for  matrices  with  one-cycle  skeleton 


We  shall  now  prove  that  the  scaling  procedure  described  in  Section 
4.1.2  works  for  the  simplest  case,  where  a  skeleton  of  A(e)  (in  its 
explicit  form,  as  assumed  in  Note  1  of  Section  4.2.1)  has  only  one 
cycle.  We  shall  then  move  to  the  more  general  case  where  all  skeletons 
in  A(c)  has  multiple  cycles.  Recall  that  the  skeleton  of  A(e)  having 
one  cycle  means  that,  after  a  symmetric  permutation,  the  skeleton  will 
be  in  BDCM  form  with  only  one  block,  while  multiple  cycles  implies  that 
the  BDCM  form  will  have  multiple  blocks. 

Before  stating  the  three  assumptions  which  are  retained  throughout 
this  and  the  next  subsections,  we  state  some  definitions  that  we  need. 

The  dimension  of  a  principal  sufcmatrix  P  of  matrix  A  is  the  number 
of  rows  in  P.  The  order  of  a  principal  submatrix  is  the  order  of  its 
determinant,  i.e.  the  order  of  the  associated  principal  minor.  A 
diagonal  of  a  principal  submatrix  of  dimension  n  is  defined  as  a  set  of 
n  elements  containing  precisely  one  entry  from  each  row  and  each  column 
of  P.  The  diagonal  containing  the  entries  a^-^,  am2fin3^‘**'amn,ml 
will  be  represented  as  the  ordered  set 

S  =  (mp  m2,  ....,  m^) 

There  are  n!  diagonals  in  P.  If  P  is  the  matrix  A  itself  (in  explicit 
form),  then  some  diagonals  will  be  the  skeletons  defined  before.  The 


-• W  X  TJ*  T*  _»v,pj«J  *vi  rJ1  J* 


v*  wv  .iwwjwj  v  vwi 


order  of  a  diagonal  is  defined  as  the  order  of  the  product  of  all  its 
entries.  There  is  one  special  diagonal  of  any  leading  principal 
submatrix,  namely 

Sq  s  (If  2,  ....  /  n) 

that  will  be  called  the  canonical  diagonal.  A  gantiauaua  principal 
submatrix  is  defined  as  a  principal  submatrix  with  contiguous  rows.  It 
will  be  represented  by  two  numberSf  the  idex  of  its  last  row  i  and  the 
index  of  its  first  column  j,  and  denoted  as  M[i,j],  with  a  subscript  on 
M  to  indicated  dimension  when  needed.  The  overlap  of  two  contiguous 
principal  sutmatrices  is  defined  as  the  number  of  common  rows.  It  is  an 
integer  ranging  from  0  (no  overlap)  to  m^,  the  smaller  of  the  dimensions 
of  the  two  submatrices.  Finally,  the  closure  of  a  principal  submatrix  M 
is  defined  as  the  smallest  dimension  contiguous  principal  submatrix 
containing  M. 

We  can  now  state  three  assumptions  under  which  our  scaling 
procedure  is  derived,  and  the  two  lemmas  required  to  derive  the  scaling. 


Assumption  4.1.  b^a-j,  for  i, j=l,...,N-l. 

If  we  draw  the  curves  for  p^c  and  as  in  Fig.  4.2.2,  this 
assumption  means  that  the  slope  of  p^c  is  larger  than  that  of  q^  except 
for  the  last  segment.  Note  that  this  is  not  a  necessary  condition  for 
A(e)  to  have  a  diagonal  scaling.  For  example,  consider  Eq.  (4.2.2).  We 
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have  seen  that  the  diagonal  scaling  defined  in  that  section  always  work 
no  matter  b  (the  order  of  the  eigenvalues  in  this  example)  is  smaller 
than  a^  or  not.  But  this  is  the  condition  under  which  our  algorithm  has 
been  shown  to  work. 


N-1  N 


Fig.  4.3  Pj0  and  q^  for  Assumption  1 


The  order  of  any  principal  submatrix  of  A(e)  is  equal  to  the 
smallest  order  of  its  diagonals. 


This  assumpotion  serves  to  rule  out  certain  cancellation  when 
summing  procucts  of  elements  in  different  diagonals,  ensuring  that  the 
order  of  any  principal  minor  is  not  greater  than  the  smallest  order  of 
the  diagonals  that  determine  the  principal  minor.  Again,  this  is  not  3 
necessary  condition  for  a  diagonal  scaling  to  exest. 


The  third  assumption  is  motivated  by  the  following  example. 


Consider 


112  3  4  5 

Figure  4.4 


There  are  two  principal  submatrices  and  M2  whose  orders  are  3.  In 
order  to  bring  q.^  to  p.^0,  we  ought  to  scale  three  of  the  l's,  e.g.  a12, 
a23  and  a34,  to  e  and  scale  the  remaining  two  l's,  namely  a45,  a^g,  to 
e2.  Since  the  diagonal  scaling  will  not  change  the  principal  minors,  we 
shall  end  up  with  some  entries  of  order  less  than  1,  no  matter  how  we 
arrange  the  scaling.  (After  scaling  the  order  of  the  gcd  of  all  the  lxl 
minors  should  be  equal  to  1  since  the  order  of  b^  is  1  in  this  example.) 
The  source  of  the  problem  here  is  the  overlap  of  two  principal 
submatrices  whose  orders  are  equal  to  P3C,  where  3  is  the  first  turning 
point  in  the  graph  of  Fig.4.4.  To  avoid  this  situation,  we  impose  two 
more  assumptions. 


Suppose  the  order  of  an  man  contiguous  principal  submatrix  is 
p^.  If  there  is  an  sxs  contiguous  principal  sutmatrix  M2  that  overlaps 
with  and  the  overlap  is  t  (an  integer  from  0  to  min{m,s}),  then  the 
order  of  M2  satisfies 

^m— t+l"*"^Tt— 1+2"*"  ‘  •  ,+^m-t+s+l 

Consider  the  previous  example  again.  If  A(e)  satisfied  this  assumption, 
i.e.  if  one  of  the  M^,  i=l,2  had  order  l+2+2=5  instead  of  3,  the 
difficulty  encountered  in  that  example  would  have  been  eliminated.  From 
this  assumption  we  also  can  see  that  any  contiguous  whose  order  is 
pjjjj0  must  be  the  unique  contiguous  principal  submatrix  that  has  this 
order . 

&5s.usptian  1*1: 

For  each  i=l,2,«...  the  principal  submatrix  of  A(e)  whose 
determinant  is  is  a  contiguous  principal  submatrix.  Furthermore, 
Mj^,  i=l,....,  i.e.  the  rows  of  the  are  contained  in  those  of  the 

Mi+1* 

The  following  theorem  now  states  that  the  scaling  procedure 
described  in  Section  4.1.2  does  indeed  induce  MSSNS  under  Assumptions 
4. 1-4. 4.  Since  the  steps  1-3  can  be  applied  to  any  A(e),  we  shall 
suppose  that  the  A(e)  discussed  in  Theorems  4.1  and  4.2  has  already  gone 
through  these  steps.  In  other  words,  we  suppose  that  some  skeleton  of 


A(e)  already  forms  a  BDCM  of  one  cycle. 


Define 

m^  :  turning  points  of  qjC. 

M^  :  principal  sutmatrix  whose  determinant  is  equal  to  p^c. 
f^  :  slope  of  segement  of  qjC  between  and  m^. 

Ij  :  set  of  row  indices  of  Mj. 

We  now  have  the  following  theorem. 

Theorem  4.1:  Suppose  A(e)  satisfies  Assumptions  4.1-4. 4,  and  has  a 
skeleton  that  forms  a  BDCM  of  one  cycle.  Then  the  scaling  matrix 

S  =  diagte5-'-, e31*}  (4.5) 

causes  SAS-1  to  have  MSSNS,  where  s^,....,  sN  are  defined  by 

S  j  F  if  1  ^  X  j  f 

SN  =  °* 


Proof: 

According  to  Assumption  4.4,  the  M^  are  contiguous.  Now,  consider 
M-^  and  M2.  As  a  consequence  of  Assumption  4.4,  M]_  is  contained  in  M2 
and  each  entry  a^j  of  Mj,  i>j,  must  belong  to  a  contiguous  principal 
submatrix  M[i,j]  containing  aj,j+i»  •— *'ai-l,j^  which  are  the  elements 


of  the  canonical  diagonal  of  the  original  matrix  A  and  have  orders 
aj'aj+l'***'ai-l  resPectively*  We  then  have 

0(ai;j)  +aj  +  aj+1  +  ....  +  a^  2  (i-j+Ufp  i>j 

The  scaling  defined  in  this  theorem  satisfies 

sj"sj+l  =  fI-aj' . '  si-l"si  =  fl~ai-l 

Denote  the  order  of  the  (i,j)-th  entry  after  scaling  as  O(a^j). 
Considering  the  fact  that  the  diagonal  scaling  does  not  change  the 
principal  minors,  we  have 

0(Sij  )+(aj+Sj-Sj+1)  +. . .  .+(ai_1+si_1-si) 

=  0(aij)+(aj+f1-aj)+....+(ai_1+f1-ai_1)  >  (i-j+l)fx 
or  O^j)  2  fp  i>j  (4.6) 

The  entry  a^j,  i<j,  in  the  upper  right  triangle  of  M2  has  an  order 
equal  to  or  higher  than  f^  before  scaling.  After  scaling,  for  i<j,  and 
considering  fpjaj/  i=l,...rN-l,  we  have 

0(ai;j)  =  0(ai;j)+S2"sj  =  0(aij)+(si-si+1)+....+(Sj_1-Sj) 
=0(aij)+(j-i)f1-ai-...-aj_1 
2  fx,  i<j  (4.7) 

Also  0(3^)  =  0(122)  ^  fl  (4.8) 

Combining  (4.6)— (4.8)  we  can  conclude  that,  after  scaling,  every  entry  in 
M2  has  an  order  equal  to  or  higher  than  f2* 


Now  consider  the  entries  contained  in  M2  but  not  in  (see  Fig. 
4.5).  Using  the  same  argument  as  before  and  the  Assumption  4.3 ,  we 
have  for  a^,  i>j,  i*llf 

0(aij)+aj+....+ai_1  2  tf1+(j-i+l-t)f2  (4.9) 


Figure  4.5 


where  t  is  the  overlap  of  and  M[i,j],  Since,  after  the  scaling 
defined  in  this  theorem,  there  will  be  exactly  (j-i+l-t)  canonical 
diagonal  elements  of  M[i,j]  having  order  f2,  and  t-1  having  order  f^, 
from  (4.9)  we  have 

0 (a^j )  2  fi»  i>jf  i^-Ii  (4.9) 

Arguments  similar  to  those  used  in  deriving  (4.7)  will  prove 
0(lj,  j)2fp  i<j,  iSIj.  Repeating  this  procedure  for  a^j,  i6I2\I^,  we  can 


I." 


Ody)  2  f2»  i€  I2\Ii 

Ihis  procedure  can  easily  be  extended  to  M3,  M4,  ...  and  finally 
we  can  show 

OSy)  2  £k' 

Since  there  is  one  set  of  ixi  minors,  i=l,...,N,  whose  orders  are 
exactly  (the  a^i+1,  iti^Xi^),  the  theorem  is  proved. 

The  following  example  illustrates  our  approach. 

Exanple  4.1  Consider 

e3  e3  © 

©  e3  e3  e 

A(e)  = 

e3  ©  c2  e7 
e6  e®  (e^)  e7 

Before  applying  Theorem  4.1,  we  have  to  apply  steps  1-3  to  A(e).  First 
we  identify  the  only  skeleton,  which  comprises  the  circled  elements  in 
the  matrix  above,  and  corresponds  to  the  permutation  (1,2, 3, 4).  To 
transform  this  permutation  to  canonical  circulant  form,  we  use  the 
symmetric  permutation  9t=(l,3).  Then 
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A]_(C)  =  TA(e)T”^  = 


e5  e4  c3  1 


e®  e®  e®  e7 


This  matrix  has  ai=l,l,0f6,  and  bj=2,2,2,2.  In  other  words,  m-j=4,  f-|=2. 
Henc^  using  Theorem  4.1,  we  have  s-j_-S2=f i-a^l,  S2-s3=f2-a2=l,  s3~ 

s4=f2-a3-2,  or  S2=4,  s2=3,  s3=2,  s4=0.  This  leads  to 


A2(g)  =  SfeJA-^eJS^r1  =  STA(e)T"1S~1  = 


e2  e2 
g2  e3 
e3  e3 


G2 


e3 


e5  e11 
g2  e4 
g2  e2 


It  is  easy  to  check  that  A2(c)  has  MSSNS,  since  its  eigenvalues  (which 
are  the  same  as  those  of  A(c))  and  invariant  factors  have  the  same 
orders. 


4.2.4  Scaling  For  Matrices  With  Two-Cycle  Skeleton 

We  have  derived  the  scaling  of  Theorem  4.1  for  the  case  where  the 
skeleton  only  has  one  cycle.  Now  we  shall  extend  this  result  to  a  more 
general  case  where  the  skeleton  has  two  cycles.  From  the  results  in 
Section  4.1,  we  know  that  after  some  symmetric  permutation  A(e)  can 
always  be  brought  to  the  following  form 


where  the  skeletons  of  A11  and  A22  are  BDCM  with  one  cycle  and 
A-q  is  an  mxm  matrix.  We  then  have  the  following  lemma. 

Lemma  4.1; 

If  A(e)  has  the  form  of  (4.10)  and  satisfies  Assumptions 
4.1  and  4.2,  then 

1.  a^  =t>2=  *  ♦  •  *  =^nv=l^Yn  =  k  (4.11) 

2.  ciji.a^,  j=m+l, . . . .,  NH1,  i=l, .... ,m.  (4.12) 

£l2Qf; 

Fran  the  assumptions  of  this  lamia,  we  know  that  Ajj  is  a  BDCM  with 
one  cysle  and  that  its  determinant  is  a  principal  minor  of  order 
a^+.^.+ajj,.  But  any  mxm  principal  minor  must  be  equal  to  or  larger  than 
pmc=bi+b2+....+bm.  Therefore 

aj+... ,+ajj,  2  (4.13) 

On  the  other  hand,  from  Assumption  4.1  we  know  that 

ajibj*  i,j=l,....,m-l  (4.14) 

Combining  (4.13)  and  (4.14)  we  have  (4.11).  Using  Assumption  4.1  and 
(4.11)  we  have  (4.12). 


From  now  on  we  shall  suppose  that  A(c)  has  the  form  of  (4.10)  and 
satisfies  (Assumptions  4.1  and  4.2,  and  therefore)  (4.11)  and  (4.12). 
The  required  scaling  for  this  case  will  be  stated  in  the  following 
theorem.  In  order  to  prove  this  theorem,  we  need  to  augment  Assumption 
4 3  by  the  following  Assumption  43a.  We  also  introduce  two  new 
assumptions.  Assumption  4.5  and  4.6. 

Ass.unpt.ion  1.3a 

Suppose  the  order  of  an  mxm  contiguous  principal  sutxnatrix  is 
pmc.  If  there  is  an  sxs  principal  submatrix  whose  closure  M2  of 
dimension  r  overlaps  Mlr  and  if  the  overlap  is  t,  then  the  order  of 
satisfies 


°(Nl)  2  bnrt-r-t-s+i:bfr-t-s+l+*  * •  ,+fcta+r-t-l  ^ 


where  the  sum  above  that  we  have  denoted  by  h  is  as  marked  on  the 
figure. 


Figure  4.6 


m+r-t-s 


A(e)  has  the  form  of  (4J.0)  and  the  diagonals  contained  in  A11  and 
A22  determine  the 

This  assumption  implies  that  there  is  at  least  one  diagonal  of 
order  that  is  contained  in  Ajj.  and  A22  and  has  the  order  of  for 
each  i. 

To  simplify  our  proof,  we  impose  the  following  assumption. 
Assumption  1*1: 

The  contain  am+1^nH.^  for  i=2,3,.... 

Note;  We  believe  that  without  this  assumption  the  proof  still  can  be 
extended  but  the  notation  will  get  very  complicated. 


Theorem  1*2: 

Let  M^,  m^,  Ij  and  f^  be  defined  as  before.  Suppose  A(e)  satisfies 
Assumptions  4.1-4.6  and  4.3a.  Then  the  scaling  matrix  is 


S  =  diagfe31,....,  esN} 


(4.15) 


3i"si+l  ={' 


xr  i=m 


f j-aif  i*m  and  i*Ij 


(4.16) 


where 


x  =  max  {bi  -  Z  (ak-bk)  ^(a^) } 


(4.17) 


and 


A^c) 


■  TA(e)T-1  = 


©  e2  e2  e2 
c  (2e)  e2  c2 
lie© 


?  c2  (?)  e^J 


Here  aj=l,l,0,2,  bj*l,l,l,l.  In  the  following,  we  shall  repeatedly 
aplly  Theorem  4.2  to  A-^(c).  Since  the  skeleton  forms  a  BDCM  with 
three  blocks,  we  start  from  the  lower-right  3x3  principal  submatrix, 
which  consists  of  two  blocks.  For  this  submatrix,  m=l,  b^=l,  1,  1, 
a^*l,  0,  2,  b=b^*l,  J^={2,  3},  J2={1}» 

i-1 

x  *  max  {bi-Z  (ak-bk)  "  °(*ii>}  =  1»  9=2,  h=l. 

if  {2,3}  1  k*2  *  *  13 

j*l 


Therefore  a  scaling  for  this  3x3  principal  submatrix  of  A^(e)  follows 
from  the  calculation 


il"i2®°'  i2-i3*”xs>'lf  i3-i4=b3-a3=l. 
or  i1=i2*i4=0,  ij*!. 


Applying  this  scaling  to  A1(e),  we  obtain 

A 2(e)  =  diag{l,l,e,l}A1(e)diag{l,l,e_1,l} 


Now,  apply  Theorem  4.2  to  A2(e)  again.  Taking  the  (1,1)  entry  as 
and  the  complementary  3x3  principal  submatrix  as  A22  of  Lemma  4.1,  we 
see  that  A2(e)  satisfies  the  requirements  for  Theorem  4.2.  Therefore  we 
have  idpI,  J^={2,3,4},  and 

x  =  max  -  0(a^j )  }=0, 
i/j 

So  the  scaling  remains  the  same,  i.e. 

S  =  diag{l,l,e,l} 
and 

A2(e)  =  SA-^cJS"1  =  STA(e)T_1S“1 


which  has  a^*b^=l, 1,1,1.  Therefore  A2(e)  has  MSSNS. 

Now,  let  us  prove  Theorem  4.2. 

£l 

According  to  Theorem  4.1,  the  scaling  defined  in  (4.15)-(4.17)  will 
ensure  that  the  orders  of  the  entries  in  the  i-th  row  of  An* (where 


A^j 1  denotes  the  submatrices  after  scaling)  have  order  equal  to  or 
larger  than  and  the  orders  of  a 12'fw  311(3  aml'  wb3-cbr  in 

this  case,  are  equal  to  fp  The  entries  in  i-th  row  of  A22*  have  order 
equal  to  or  larger  than  and  the  orders  of  m+2,'**M'aN-l,N'  311(3 
aN  m+l'  are  e(3ual  t0  bm+l',,**'bN*  Now  let  us  check  tbe  orders  of  the 
entries  in  A21'  and  A^'. 


It  is  easy  to  see  that  after  scaling  the  order  of  the  (i,j)-th 
entry  in  A2]/  is 

i-1 

0(1^)  =  Ofa^)  +  Z  Ca^-fc^ )  +  x 
J  J  k=mfl 


i-1 

=  O(a^j)  +  Z  (a^-b^) 
k=nH-l 

i-1 

+  max  {bjKKai-i)-  Z  (a^-b^)  ^ 
i^J^  k-jii+1 

itJ2 

>  bif  i«Ji 


Also  there  exist  integers  g  and  h  such  that 


®(agh^  =  bg 


Now  define  d^j=O(aij)-0(a^j)  and  consider  the  entries  in  A-^'.  For 
columns  g+l,....,N  the  order  increment  d^j  are 


j“l 

dii  ■  -  Z  (a^-toj.)  -  x,  j=g+l,....fN 


£  Oy«k>  -  bg  +,  £  ,  <*"**>  +  °<agh> 

k=m+l  k=nrt-l 


21  (bk-^)  "  bg  +  O(a^) 
k=g 


=  ,21  (b|rak>  +  b  -  a  +  0(agh)  -  b 
k=g+l  *  * 


0(agh>*agr  bk>aRr  for  any  k 


Therefore 


dij>0 


CKa^')  =  0(ai:j)  +  dtj  l  O(aij)  l  fp  i=l,....,N, 

j=g+l,....,N 

For  the  remaining  part  of  A12‘,  i.e.  columns  m+l,....,g-l,  we  can  use 
the  following  arguments. 


In  this  area  a^j  and  ag^  must  be  contained  in  a  diagonal  of  some 
principal  minor  containing  only  canonical  diagonal  elements  of  A-q  and 
A22  apart  from  a^j  and  agh.  In  other  words,  if  i£h,  the  diagonal  can  be 
expressed  as  (g,h,g+lr....,i-l,i, j,j+lf....,g”l)f  and  contains  h+g-i-j+2 
elements.  Since  the  principal  minor  is  enclosed  by  M[g,h],  its  order 
must  be  (according  to  Assumption  4.4)  equal  to  or  higher  than 


9 

E  bu 

k=i+j-h-2 


Therefore,  it  is  easy  to  check  that  0(a^j,)>f1  after  scaling.  For  the 
case  where  i>h  we  can  apply  similar  arguments. 

Hie  above  scaling  theorems  have  been  obtained  under  rather  strong 
assumptions.  It  is  therefore  important  to  know  that  they  are  applicable 
in  certain  situations  of  interest.  In  the  next  section  we  show  how  to 
apply  our  results  to  the  scaling  problem  in  a  high  gain  feedback  system. 


4.3  Scaling  in  High  Gain  Feedback  systems 


The  problem  discussed  in  this  section  is  the  time  scale  analysis 
of  feedback  systems.  In  particular,  we  consider  a  high  gain  feedback 
control  problem,  of  the  type  studied  by  Young,  Kokotov ic  and  in  more 
detail  by  Sanutti.  [14],  [15],  [13].  We  shall  see  that  the  methodology 
developed  in  the  previous  subsections  serves  to  extend  the  scaling 
results  of  these  references  to  a  more  general  case. 

To  simplify  our  discussion,  let  us  restrict  ourselves  to  the  simple 
situation  proposed  in  [14],  where  an  N-state,  m-input,  m-output  system 
with  output  feedback  is  considered: 

A  AA  AA 

x  =  Ax  +  Bu 

A  A  A 

y  =  Cx  (4.18) 


u  =  K(e)y 


with  A,  B,  C,  K(e)  being  matrices  of  appropriate  dimensions 


It  is  assumed  that  system  (4J.8)  satisfies 

A  A  ■  4  A 

CA1  B=0 ,  i=l,...,q 

CA^B  is  nonsingular  for  some  q>l. 

Under  this  assumption,  there  is  a  linear  transformation  to  transfer 
(4J.8)  to  the  following  form  [14], 


or 


xo" 

xn 

Aq  Aqi  0 

1 

• 

s 

0  0  * 

• 

;  i 

K 

y 

AqO  ^1^2 . Agq 

=  [0  I  0  ••••  0]x 

u 

=  K(€)y 

x0 

y. 

Aq  Aqj^  :  0 

xl 

• 

= 

0  0  : 

9 

:  1 

^  1  *q2  . 

xo 

0 

xi 

• 

• 

• 

+ 

• 

• 

*q 

Bq 

»  - 

x0 

X1 


Suppose  that  we  have  a  high  gain  feedback  problem  where 
K (e)  =  e“^I  *  p^I,  p=€~l 


Wl 


If  we  scale  A(e)  by  (which  corresponds  to  time  scaling)  and  denote 
the  resultant  matrix  as  A^(e),  then  we  see  that  A^(0)  does  not  have 
SSNS.  Therefore  a  scaling  is  necessary  in  order  to  have  a  standard  two- 
time-scale  form. 

We  can  also  see  that  A1(e)  is  in  the  block  explicit  form  with  one 
cycle  (i.e.  the  same  form  as  that  in  the  previous  sections  but  now  each 
entry  is  a  constant  matrix  multiplied  by  a  power  of  c).  Applying  the 
scaling  in  Theorem  4.1  but  treating  each  block  as  one  entry  of  matrix 
A(c)  in  the  theorem,  we  can  see  that  a  scaling 

S  =  diag{pc3l,p<3l,p<3-1I,p<2'"2I,...,I} 
will  bring  A(e)  to 

*01 

Ax(e)  =  SA(e)S_1  *  0  0 


P*%0  P^Aql  P^%2 


*0  ”01 


e_1I  ec?+1Aq0  ec3+1Aq1  e^2 
_  _  +Bc3 


=  D(e)S(c) 

As  in  Chapter  3,  we  check  the  Schur  complements  of  £(0)  to  see  if  A^ 
satisfies  the  MSSNS  condition  or  not.  Note  that  there  are  two  time 
scales  in  this  example  so  we  need  to  check  Aq  (  i.e.  the  A^  in  Chapter 
3)  and  the  Schur  conplement  of  Aq  (i.e.  the  A22  in  Chapter  3).  Here  we 
only  consider  the  case  where  Aq  is  of  full  rank.  Otherwise  the  scaling 
will  be  too  complex.  Since  Bq  is  full  rank,  we  can  see  that  the  two 
Schur  complements  are  full  rank.  Therefore  A^e)  has  MSSNS. 

Now  consider  a  more  general  cases  where  K(p)  is  an  analytic 
function  of  p  at  p  =oo .  This  is  a  case  that  has  not  been  considered 
before  in  the  literature. 

Let  us  first  consider  the  (^t^Jx^+nj)  submatrix 


Ag+K(p)Bq  x. 


of  A(e) .  Now  suppose  the  Smith  form  of  Ag+K(p)Bq  is 


A^KtpJBg  =  P(p)D(p)Q(p)  =  Pdiag{llfpl2, . }Q 


where  P(p)  and  Q(p)  are  uniirodular.  Then  by  a  linear  transformation 
get 


=  diag{Q,I,I,P“l}  A  diag{Q-1,I,l,P} 


0  Q 

.  I  0 
•  • 

.  0  I 

0  P 

D(p)  x . x 


To  see  the  necessary  scaling  for  Ap  let  us  consider  an  8x8  example. 


Example: 

Let  Q= 

Then 


0  a 
b  0 


P= 


0  e 
f  0 


Ai  = 


D(p)  = 


p4  0 
0  p8 
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0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 


1 

0 

0 

0 

0 

0 


0 

1 

0 

0 

0 

0 


After  some  synmetric  transformation  we  have 

0  a  i 


0 

0 

0 

f 

0 

0 


This  is  a  two-cycle  case.  Applying  Theorem  4.2,  we  have  the  scaling  for 


V 

S'  =  diagfp9,?8,?7,?6,?6,?4,?2,!} 

0  a  0  0 
0  0  10  0 
0  0  0  f 
10  0  0 

0  b  0  0 
0  0  10 
0  0  0  0  e 

10  0  0 

which  has  MSSNS.  It  is  easy  then  to  derive  the  scaling  for  the  original 
system  (siirply  keep  the  (1,1)  block  unchanged)  and  find  that  the  system 
has  three  time  scales.  Prom  this  example  we  notice  that,  unlike  in  the 
simple  case  considered  just  prior  to  it,  the  scaling  is  determined  by 
the  structure  of  Q(p)  and  P(p).  Therefore,  in  general,  if  K(p)  is  a 
function  of  p,  one  should  first  apply  the  Smith  decomposition  on  K(p)  to 
find  Q(p)  and  P(p)  and  then  use  the  scaling  approach  in  the  previous 
sections  on  to  obtain  the  necessary  scaling. 

From  the  results  in  this  section  we  see  that  the  algebraic  approach 
developed  in  this  thesis  not  only  gives  a  clearer  picture  of  the  scaling 
proposed  in  [14],  e.g.  the  justification  of  the  scaling  used,  but  also 
provides  a  tool  to  treat  the  more  general  case  where  K(p)  is  an  analytic 


function  of  p, 


4.4  Conclusion 

In  this  chapter  we  have  discussed  the  situation  where  the  MSSNS 
condition  is  violated.  We  first  observe  that  the  lack  of  MSSNS  is 
equivalent  to  inequality  of  the  orders  of  eigenvalues  and  orders  of 
invariant  factors.  This  suggests  the  use  of  a  non-unimodular  similarity 
transformation  that  keeps  the  eigenvalues  the  same  while  changing  the 
invariant  factors.  We  focus  attention  on  a  special  transformation, 
namely  scaling,  i.e.  a  diagonal  similarity  transformation  matrix  with 
the  diagonal  entries  being  powers  of  e. 

In  general,  diagonal  scaling  may  not  be  successful  in  inducing 
MSSNS  if  applied  to  the  system  matrix  A(e)  directly.  Our  procedure 
requires  us  to  first  transform  A(e)  to  its  Smith  form  A^(e).  We  then 
identify  a  skeleton  in  A1(e),  as  defined  in  Section  4.1.  The  next  step 
is  to  apply  a  symmetric  permutation  to  A^(e)  so  that  its  skeleton  forms 
a  block  diagonal  circulant  matrix.  The  diagonal  scaling  can  then  be 
derived  fairly  easily  from  this  form,  under  certain  assumptions. 


Several  points  should  be  made  here. 

1.  The  scaling  is  not  unique.  This  is  because,  in  the  first  place, 
the  skeleton  is  not  unique.  For  example 


A(e) 


5e  c  e 

e  c  c?  e 

e4  e  e  1 

fp-  eP  e4  e4 


has  two  skeletons.  One  is  a22,  a13,  a41,  a34,  which  has  the  structure 
(2)  (3,1,4).  Another  is  a12,  a23,  a34,  a41,  with  structure  (3, 2, 1,4). 
For  the  first  one,  we  need  a  symmetric  permutation  (2,1).  After  the 
synmetric  permutation  we  have 


Ax(e) 


2 

e  sr  e 
5e  e 

e4  e  1 
e2  e4  e4 


The  scaling  for  A^c)  is  S=diag{e,e,e,l}  according  to  Theorem  4.2 
Therefore  the  overall  transformation  is 


On  the  other  hand,  for  the  second  one  the  scaling  is  simply 
S2  =  diag{e,e,e,l}  =  S 

2.  The  approach  stated  in  this  chapter  is  of  course  not  the  only 
possible  one.  For  example,  suppose 

A(e)  = 

Let 

Then 


is  in  explicit  form  and  the  scaling  is  S=diag{e,l}.  The  overall 
transformation  is 


By  inspection,  however,  we  see  that  S2  =  diag{en,  1}  is  also  a  candidate 


scaling  for  the  original  system,  because 


c  e11  1  e0-^- 

A1  ~  s2as2~1  =  =  e 

0  e  0  1 


where  n>l  and  A^  has  MSSNS. 


3.  In  general,  a  diagonal  scaling  may  not  be  sufficient  because 
diagonal  scaling  will  not  change  the  principal  minors  of  A(e).  As 
pointed  out  earlier  in  Chapter  3,  the  no  cancellation  condition  is  a 
necessary  condition  for  A(e)  to  have  MSSNS.  Therefore,  if  this 
condition  is  not  satisfied,  i.e.  if  there  are  cancellations  among  the 
lowest  order  principal  minors  such  that  the  order  of  the  gcd  of  the 
principal  minors  is  not  equal  to  the  order  of  the  sum  of  the  principal 
minors,  then  diagonal  scaling  will  not  work.  The  reason  is  that  the 
diagonal  scaling  proposed  in  this  chapter  does  not  change  the  principal 
minors,  so  that  after  scaling  A(e)  still  does  not  have  MSSNS.  Note  that 
this  cannot  be  remedied  even  if  we  first  transform  A(e)  into  its 
explicit  form  Ag(c),  as  we  have  done  in  this  chapter,  since  the  explicit 
form  does  not  ensure  this  condition. 


4.  We  have  mentioned  at  the  begining  of  this  chapter  that  scaling 
firstly  provided  a  way  to  find  an  approximation  of  the  eigenvalues 
without  confuting  than.  The  second  role  of  the  scaling  is  to  transform 
a  system  that  does  not  have  MSSNS  to  a  system  that  does.  We  can  then 
(if  the  scaled  system  satisfies  MSST  as  well)  find  a  time  scale 


decomposed  system — denote  its  system  matrix  by  A(e) — to  approximate  the 
scaled  new  system  using  the  procedure  described  in  Chapter  4.2. 
Intuitively,  if  we  apply  a  reverse  scaling  (i.e.  S-1)  to  this  time-scale 
decomposed  system  we  should  obtain  an  approximation  of  the  original 

A 

system.  Our  conjecture  is  that  the  difference  between  exp{A(c)t}  and 
S-1exp{A(e)t}S  must  have  an  order  higher  than  the  order  of  exp{A(o)t}, 
i.e.  the  norm 

I |exp{A(e)t}-S’*exp{A(e)t}S| I 

must  have  an  order  higher  than  that  of  |  |expA(e)t}  1 1.  Mathematically, 
it  may  be  true  that  the  norm 

I  li-expf-AfeJtjS^expfAteJtJSl  I 

approaches  zero  as  e  goes  to  zero,  but  further  study  of  this 
possibility  is  left  for  further  work. 

5.  Assumption  4.4  can  actually  be  derived  from  a  revised  version 
of  Assumption  4.3,  which  constrains  the  orders  of  the  overlapped 
principal  submatrices  (possibly  several  in  contrast  to  two  in  Assumption 
4.3).  Unfortunately,  the  revision  of  Assumption  4.3  is  almost  impossible 
to  check  in  a  practical  situation  and  the  subsequent  derivation  of 
Assumption  4.4  is  also  very  complicated.  This  is  why  we  have  stated 
Assumption  4.3  and  4.4  as  two  independent  assumptions  in  this  chapter. 


Several  examples  are  given  in  this  chapter,  including  the 


5.1  Contributions  of  Ibis  Thesis 


Hie  principal  contribution  of  this  thesis  is  the  development  of  an 
algebraic  approach  to  multiple  time  scale  decomposition  of  perturbed 
linear  systems.  Based  on  this  approach,  we  have  been  able  not  only  to 
obtain  a  more  direct  and  simple  description  of  the  multiple  time  scale 
decomposition  of  such  systems  but  also  to  handle  number  of  problems  that 
are  difficult  to  solve  using  previously  developed  results. 

In  Chapter  2,  using  this  approach  we  make  clear  the  connection 
between  Kokotovic's  explicit  two  time  scale  approach  and  Coderch's 
multiple  time  scale  results  and  in  the  process  we  provide  a  far  more 
straightforward  procedure  than  that  of  Coderch  that  makes  clear  the  role 
of  Smith  decompositions  and  the  Schur  complements  of  a  certain  matrix. 
Using  this  machinery  we  are  then  able  to  extend  the  multiple  time  scale 
result  to  a  larger  class  of  systems  that  satisfy  the  MSSNS  but  not  the 
MSST  condition  and  to  solve  a  feedback  time  scale  assignment  problem  for 
the  system  *=A(e)x+B(e)u  when  A(e)  and  B(e)  are  coprime. 

In  Chapter  3,  the  further  study  of  the  algebraic  structure  of  A(c) 
proved  useful  in  deriving  equivalent  conditions  for  MSSNS  and  MSST.  In 
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Chapter  2  we  had  shewn  that  the  MSSNS  (MS ST)  condition  is  equivalent  to 
the  successive  Schur  complements  of  a  particular  matrix  being  full  rank 
(Hurwitz).  In  Chapter  3  we  proved  that  the  MSSNS  condition  is  also 
equivalent  to  the  eigenvalues  and  invariant  factors  of  A(e)  having  the 
same  orders.  This  result  not  only  simplifies  the  interpretation  of  the 
MSSNS  condition  but  also  provides  a  new  and  perhaps  the  only  practical 
criterion  to  actually  verify  this  condition.  Related  to  this  result  is 
the  relationship  between  the  MSSNS  and  MSST  conditions.  We  showed  that 
A(e)  satisfies  the  MSST  condition  if  and  only  if  it  satisfies  the  MSSNS 
condition  and  the  orders  of  the  real  parts  of  its  eigenvalues  are  equal 
to  or  less  than  those  of  the  corresponding  imaginary  parts.  Therefore, 
to  check  the  MSST  condition  we  may  first  check  the  relationship  between 
the  orders  of  the  real  and  imaginary  parts  of  the  eigenvalues  of  the 
matrix.  If  they  satisfy  the  condition  we  have  just  mentioned,  then  we 
can  proceed  to  check  the  MSSNS  condition.  The  final  and  perhaps  most 
important  contribution  in  Chapter  3  is  the  clarification  of  the 
relationship  among  the  principal  minors  of  A(e),  the  orders  of  the 
invariant  factors,  and  the  MSSNS  condition.  It  is  shown  in  this 
chapter,  that  unlike  the  usual  cases,  if  the  system  satisfies  the  MSSNS 
condition,  then  the  orders  of  the  ged's  of  the  principal  minors  of  each 
size  instead  cf  all  minors  of  each  size  determine  the  invariant  factors. 
Based  on  this  observation,  1)  an  algorithm,  related  to  the  Newton 
polygon,  was  developed  to  determine  the  invariant  factors  from  the  ged's 
of  the  principal  minors  when  the  system  has  MSSNS;  and  2)  an  equivalent 
condition  for  MSSNS  was  obtained  which  emphasizes  the  role  of  the 


principal  minors  and  a  so-called  "no  cancellation"  condition. 
Therefore,  if  a  system  has  well  defined  time  scale  behavior  (or  more 
generally,  A(e)  satisfies  the  MSSNS  condition)  then  we  can  use  this 
algorithm  to  determine  the  orders  of  the  various  time  scales.  Finally, 
if  the  system  has  MSSNS,  we  provide  an  approximation  for  the  eigenvalues 
of  A(e)  in  terms  of  the  eigenvalues  of  the  successive  Schur  complements. 
As  we  point  out  in  Chapter  4,  this  is  a  rather  general  result  because  if 
the  system  does  not  have  MSSNS,  we  may  be  able  to  apply  seme  scaling  so 
that  the  scaled  system  does. 

The  results  in  Chapter  3  are  used  in  Chapter  4  to  develop  an 
amplitude  scaling  procedure  to  transform  a  system  matrix  without  MSSNS 
but  satisfying  certain  conditions,  into  one  that  does  have  the  MSSNS 
property.  The  scaling  procedure  requires  first  identifying  a  skeleton 
of  the  matrix  to  be  scaled,  that  is  identifying  a  set  of  critical 
elements  to  be  scaled.  Ihen,  using  a  symmetric  permutation  to  transform 
the  matrix  to  a  particular  canonic  form,  we  can  determine  the 
appropriate  scaling  matrix.  Finally,  we  use  our  result  on  amplitude 
scaling  to  interpret  and  generalize  recent  results  on  time  scale 
analysis  of  high  gain  or  nearly  singular  optimal  feedback  systems. 

5.2  Suggestions  For  Further  Research 

1.  The  procedure  developed  in  Section  2.8  for  time  scale 
decomposition  of  a  system  with  MSSNS  but  not  MSST  requires  an  iterative 


algorithm.  A  bound  for  stopping  this  iteration  is  given  in  Section  2.8 
as  the  order  of  the  determinant  of  A(e)  multiplied  by  the  dimension  of 
A(e).  However  it  is  possible — and  is  typically  the  case — that  the 
iteration  can  be  stopped  much  earlier.  Therefore,  one  interesting 
question  is  to  find  a  tighter  stopping  point.  Related  to  this  is  the 
question  of  possibly  developing  a  more  efficient  procedure  for  time 
scale  decomposition  than  that  given  in  Section  2.8.  In  particular,  if 
the  system  satisfies  the  MSSNS  condition,  then  only  one  step  of  the 
iteration  is  needed  at  each  step.  Thus  if  all  of  the  successive  Schur 
complements  in  Chapter  2  are  Hurwitz,  we  are  finished.  However,  if  at 
any  stage  one  of  these  matrices  is  not  Hurwitz,  then  we  must  go  back  to 
the  beginning  and  do  the  full  set  of  iterations  at  each  stage.  It  would 
be  desirable,  if  possible,  to  obtain  a  procedure  with  a  more  recursive 
flavor  than  this. 

2.  Feedback  time  scale  assingment  is  discussed  only  for  the  simple 
case  where  A(e)  and  B(e)  are  coprime.  At  least  two  extensions  of  our 
result  are  of  interest. 

1)  The  consideration  of  the  time  scale  assignment  problan  that  we 
study  for  the  general  case  where  A(e)  and  B(e)  are  not  ooprime. 

2)  A  generalized  cheap  control  problem,  namely  the  problem  of 
minimizing 

J  *  J  x'Q(e)x  +  u‘R(e)u 


for  the  system 


x  =  A(e)x  +  B(e)u 


It  is  possible  that  the  Hamiltonian  matrix  for  this  problem  contains 
the  neccessary  information  for  the  time  scale  decomposition.  Also,  it 
would  be  desirable  to  make  contact  explicitly  with  the  amplitude  scaling 
results  of  Chapter  4  and  in  particular  with  their  uses  in  interpreting 
and  extending  recent  results  on  the  time  scale  structure  of  more 
standard  cheap  control  and  high-gain  feedback  problems,  i.e.  the  case  in 
which  only  R(e)  depends  on  e  and  in  fact  is  of  the  form  eR. 

3.  In  Chapter  4  we  derive  a  diagonal  scaling  procedure  under 
somewhat  restrictive  conditions.  The  reason  for  these  conditions  was 
primarily  due  to  the  absence  of  simple  methods  for  calculating  the 
invariant  factors  for  an  arbitrary  matrix.  That  is  we  have  had  to 
impose  conditions  so  that  the  scaled  system  matrix  has  a  special  form 
and  the  invar iat  factors  can  be  readily  determined.  Further  research 
could  be  directed  in  at  least  three  directions: 

1)  Finding  simpler  procedures  for  calculating  the  invariant 
factors. 

2)  Relaxing  the  conditions  that  were  imposed  on  the  matrix  A(e)  in 
order  to  verify  the  validity  of  the  scaling  approach  proposed  in  this 
thesis  work. 

3)  Considering  the  possibility  of  the  non-diagonal  scaling.  In 
this  thesis,  only  the  diagonal  scaling  of  a  system  in  so-called  explicit 


form  is  considered.  More  generally,  a  similarity  transformation  which 
changes  the  invariant  factors  but  not  eigenvalues  can  be  represented  as 
a  non  unimodular  matrix.  By  means  of  the  Smith  form  of  this  matrix,  the 
general  form  of  the  similarity  transformation  can  be  treated  as  a 
unimodular  transformation  plus  a  diagonal  scaling.  Therefore,  an 
interesting  question  is  what  role  can  be  played  by  the  unimodular 
transformation  before  diagonal  scaling. 

4.  In  Chapter  2  we  have  given  a  definition  of  the  well-defined  time 
scale  behavior  and  an  extended  definition  in  order  to  include  systens 
that  are  Hurwitz  for  some  e€(0,eg]  but  do  not  satisfy  the  MSST 
condition.  The  Hurwitz  condition  (or,  as  pointed  out  in  Section  2.8,  a 
slightly  weeker  conditin  that  allows  A(e)  to  have  some  zero  eigenvalues) 
is  a  requirement  throughout  this  thesis.  However,  intuitively  it  should 
also  be  possible  to  define  meaningful  approximations  for  some  unstable 
systems  as  well.  For  example,  the  systems  x=x+u  and  x=(l+c)x+u  should 
be  thought  of  as  being  close.  Although  the  difference  between  exp{t} 
and  exp{(l+e)t}  grows  without  bound,  the  leading  order  terms  in  the  two 
systems'  dynamics  are  the  same  so  that,  for  example,  any  feedback  u=-kx 
which  stabilizes  the  leading-order  term  of  one  system  does  the  same  for 
the  other.  For  this  reason  it  would  be  desirable  to  extend  further  our 
definition  of  well-defined  time  scale  behavior  so  that  a  much  larger 
class  of  systems  can  be  considered.  Also  another  possible  extension  of 
this  definition  is  proposed  at  the  end  of  Chapter  4  to  define  the  way  in 
which  an  amplitude-scaled  system  approximates  the  original  system. 


5.  It  has  been  pointed  out  in  Chapter  3  that  the  eigenvalues  of 
A(e)  are  clustered  into  several  groups  according  to  the  eigenvalues  of 
the  successive  Schur  complements.  It  is  possible  that  a  frequency  scale 


decomposition#  i.e.  the  decomposition  of  the  transfer  function  of  the 
system,  could  be  achieved  based  on  the  machienary  provided  in  this 
thesis. 

6.  As  we  have  seen  in  Chapter  3,  Section  3.5,  if  the  Jordan  form  of 
A(e)  does  not  have  nontrivial  Jordan  blocks  having  eigenvalues  of  order 
higher  that  0  and  if  the  similarity  transformation  matrix  M(c)  is  also 
unimodular,  then  A(e)  has  MSSNS.  However  as  the  examples  in  that 
section  also  have  shown,  the  reverse  is  not  true.  In  other  words,  the 
matrix  A(e)  can  have  MSSNS  but  at  the  same  time  M(e)  can  be  non- 
unimodular  and  there  can  be  a  nontrivial  Jordan  block  whose  eigenvalue 
is  of  order  higher  than  zero.  Further  research  is  required  to 
investigate  the  relationship  between  the  Jordan  form  of  A(e)  and  the 
MSSNS  condition  so  that  a  neccessary  and  sufficient  condition  can  be 
derived. 

7.  It  would  be  useful  to  develop  numerical  algorithms  for  the  time 
scale  decomposition  results  in  this  thesis.  The  crucial  problem  in  this 
regard  is  the  development  of  a  numerically  sound  procedure  for  computing 
the  Smith  decomposition  of  A(e).  Although  an  algorithm  based  on 
Verhgese’s  result  118]  can  be  constructed,  further  investigation  is 
still  required  to  find  a  well  posed  algorithm.  Note  that  thanks  to  the 
results  in  Chapter  3,  it  is  possible  to  derive  an  algorithm  for  directly 


Appendix  1 


In  order  to  obtain  a  recursive  method  for  the  computation  of  the 
Skj_  as  well  as  the  machinary  required  to  prove  Theorem  2.3,  we  introduce 
some  definitions  and  properties  related  to  the  Schur  complement. 

Consider  a  matrix  A 


A11  . Atn 

*n]  . V 

where  the  A^,  i=l  ,...,n  are  square  matrices.  Define  matrices  C^'j-  and 
,  as  follows 


The  following  property  states  the  relationship  between  +  for 


different  1. 


Eiapfitty  i 


fJCfl  a  pJCfl- 1 


“  Cif  j-l-k+2  (Cj-l-k+2)  ~tcj-l-k+2, 


Here  we  denote  Crr 


4c, 1 


Jj-1+1 ,3-1+1 


by 


for  convenience. 


Proof:  By  definition  (A1 .3) ,  (A1 .4) 


1J 


-rck, 1-1 


(Aii— Ai,j-iKW'  ■ 


gk , 1- 1 1 

i-l > S\ 


_  4c, 1-2  _  a  ,  -  >  4c,  1-1 

■  ‘  Ai,3-l-^2^3-l-k+2^j-l-rC+2 

[>,1-2- 

(Ail • • ,Ai,i-l ) (Yi-l )  ^ 


3^, 1-2 


-1  t 


Jr ,  1-1 


-  =  (f.1  ? 1  (  C  •  .  ,  5 

— j.— rC+2  ij  -  . 3*“l,“-'C+2  j-l-rc+2 


Ai-1 , j-l-k+2 


A1 , j-I-k+2  .  . 

•  ./,!,!  N 

:  '  '"J3-l-k+2  J 


Based  on  this  fact  we  can  prove  the  following  property  that  relates 
the  S^j  with 


Define 


S?"  - 

TC,1  "  A11B1,k+l-t 


(A1.6) 


.1 

o 


We  have 


.  1  /• -’I » 1  ,l,i-l+l 

*»k-i  "  v  "k-i ;  ii-i ,k+l- ‘ 


lit 


k-1 


^Ik+l  =  Ak,k+i  "  ±=1AkiSk^k-i 


In  particular,  when  1=0,  we  have 


c,1’1  -  7 


k-1 


'k,k  =  Akk  "  Akk  “  i_1AkiSk,k-i 

which  is  the  same  as  (2.27)  in  Chapter  2. 


,1 


Proof;  we  proof  this  property  by  induction.  Suppose  for  some  !<Li4c-2 
we  have 


k,\k+l  =  Ak,k+1  "  [Ak  1  *  *  * Ak ,k-i- 1]  [Yk-i- 1  ] 


-1 


1  ,k+l 


,1*1 ,i 
L  k-i-l 


-E 


a,  v  ,(c,1»!r1'~L+1»;3 


j  =  l  ^-0  -<-j,k 


xi 


Using  the  matrix  inversion  lemma,  (A1.7)  can  be  rewritten  as 


k-2,k-1* 


( c  ^  ^ ^ 

^  ''k-1  ' 


fk-1  ,k+l] 


=Ak,k+l“  ’  Ak  1 ,  •  *  h. ,k-2  j ( Yk-2}"*  i 


4 

A1 ,k+l 

A1,k-1 

• 

• 

t- 

• 

• 

• 

• 

Ak-2,k+l 

Ak-2,k-1 

-rj » ^ 


*-1 ,k+l 


r- 


•(c.':])-1 1  (a,  .  ....a.  ,hy.  .)->  1  ••+1  | 

A  1  l  *-1,1  *c-1,k-2/v  k-2'  ;  k-l,k+l 

A,,  -s  ,  J 


k-2  ,r£+l 


_C.1»1  _ r1+1 > 1 

u*-1,k+l“  °k-1,k 


k+1 


+A  ( r  *  "5 


k,k-1  v  "k- 1 


(Y„_,)_1  ’-A, 


•  •  •  Al 


k-1 ,T##Ak-1 ,*-21  k-2 


k-1 ,k+l '  (AM) 


A, 


k+2,k+l‘ 


Considering  (Al.l)  and  (A1.2)  we  have 


'"k-l ,k+l 


^k-1 ,k+l  “ 
Therefore  by  (A1.3) 


* 


M.i 

*k-1 ,k+l 


=  +  a 

rii,k+l  +  Ai,k-1 


„1  ,1 

"k-1 


_  T3^-  +  1  1 

"  ~i,k+l 


cl+1,1 

uk-1  ,k+l 


Inserting  (A1.10)  into  (A1.9)  we  have 


(A1.10) 


„1 ,1 
°k,k+l 


r  51+i  *i. 

1  !  1 »k+l 

Ak,k+i“[Aki**‘Ak,k-2](Yk-2r  I  • 

31+1 » 1 

-  k+2,k+l 


1,1  }-i 


C,1+1  >  1 

°k-1 ,k+l 


(Al.ll) 


which  is  the  same  as  (A1.7)  when  i»l. 


For  i*k-2  we  have  from  (A1.7) 


,-.1,1 


—  a  —  a  A^l-nl^l  >k-2 

k,k+l  “  Ak,k+1  Ak1A11ri1,k+l 


k-2 

2-  A,  ■  j 
3=1 


3 


*»■*■  J  J  ,  X 

This  completes  the  proof  of  (A1.7) . 


Property  2  will  be  used  to  prove  our  main  result.  Theorem  2.3.  For 


the  same  reason  we  need  the  following  property. 

Property  2.: 


rl,i-k+l 

''I r  A 

y/Z. 


,-.1+1  ,i-l 
'■'k-i ,k 


.1 ,i-l 
'k-i,k-l 


(  r»1  »  1 

^  '"k-l 


) 


1.1,1 

-*~1 


(A1.12) 


Proof: 

Suppose  (A1.12)  is  true  for  i»lM..rm-l,  we  prove  it  is  true  for 

i=*m. 

Let  us  consider  ,  .  leu-1 .  According  to  (Al.2)  and  (A1.5) 

y  J& 


1-1-111, k  “  'li-Ujk 


r>1*1  /,.1»1'\"“1  fJ- 1 1 

-k-ni,k-r  ^i-l;  .-i-l, k 


^1+1,1 

Ts-m.k 


.M  ,.1,1r1rl,1 
1i-m,k-l^  "k-l;  Tt-l,k 


Similarly  by  (Al.5) 


C,1^  =  (J-,2  _  r-T,1  /  r1  ,1  s-1.1,2 

.-i-m,!!  ^k-njk-1-1  <i-l-1  '  ^k-l-l  ,k 

Using  (Al.2)  to  find  an  expression  forc^’^  .  ,  we  obtain 


-1, 3  pl+1.1  _  -1,1  ,-1,1r1-l,1 

~  \s-m»k  °k-a,k-llwfc-l;  '■'k-l,k 


—1,1  /■  — 1  ,  1  \  —  1  -1+ 1,1 

Tc-m ,  1:-1- 1  ^  "k-1- 1 1  °k-l- 1 


il  1 1  /  — 1  ,  1  \  — 1  — 1  , 1  1,1^  — 1—1,1 

'■"k-m,k-l-1  ^  “k-1-1 ;  Ts-1-1  ,k-l^  Tc-l'  ‘'k-l.k 


- 1+1,2 

ut-m,k 


-1,2  /— 1,1  >  —  1-1,1 

wk-n,k-l^v'k-l;  °k-l,k 


Then 


-1,4 

nc-m,k 


-1,3  .  -1,1  /-1,1  n-1-1,3 

1:-a,k  -lc-m,k-l-2^uk-l-2;  °k-l-2,k 


_  -1+1,2 

"/k-in,ri 


,-1,2  < 

1c-m,k-l 


-  1,U 
1  ) 
X 


1-1,1  -1,1  ,-1,1  - -V-l+1, 2 

"'k-l ,k"  xC-m ,k-l-2 ^  T-:-l-2;  ^-1-2,1; 


+ 


-1,1  ( 

^K-Q,k-1-2V 


.-1,1  ) 

~k-l-2; 


1-1,2  c-1,1\~l-l,1 

°k- 1- 2 ,  k- 1 ' °k- 1 3  “'k- 1 ,  k 


..  r>l+ 1  ,3 

—  '-O «.  1  - 

n.— n 


-1,3  (  -1 , 1  \  —  1-1,1 

■k-m,k-lvnv-l;  ^c-l,k 


Similarly  we  have  for 

£\~Ul  f  a 


-l,m«l+1  _  -1,1  .  a-1r1»1  ,-1 ,1  >-1-1+1  ,3-1 

''k-m ,  k  "  '■'k-a  ,k  .  -  Tc-n ,  k-  j v  wk-  j '  v'k-  j ,  k 

— 1  ,  a**  1  ,  — 1,1\— 1— 1,1 

“  wk-m ,  k-1  ^  'Ic-l 1  ~k-l ,  k 

— 1  + 1 ,  a-l  —  1 ,  a— 1  ,  —  1  ,1  \™1  (—1 , 1 

=  ^k-a,k  "  xs-m ,k-l '  ^c-1 3  ^-l,k 


Thus  we  have  that  (At- 12.  )  is  true  for  i=a 


APPENDIX  2 


To  prove  Theorem  2.3  we  first  need  two  lemmas.  The  first  lemma 
provides  an  explicit  form  of  the  projection  along  the  range  space  onto 
the  null  space  of  a  specific  matrix  A.  It  also  displays  the  three 
equivalent  conditions  for  a  matrix  to  have  semi-simple  null  structure. 
This  lemma  is  important  because  the  results  in  [1]  are  based  on  the 
eigenprojection  of  zero  eigenvalues  of  the  matrices  F^0  (see  Section 
2.2  for  details). 

terma  1 


(A2.1) 


where  A^«  R0301,  A2  *  R^™1.  Suppose  that  [A^  A2]  has  full  rank.  Then 
the  three  statements 

1.  A^  is  nonsingular. 

2.  R(A)  &  N(A)  =  R13"*11. 

3.  A  has  SSNS. 

are  equivalent.  Furthermore,  if  A^  is  nonsingular,  the  projection  along 
R(A)  onto  N(A)  is  given  by 


It  is  easy  to  see  that 


R(A)  =  {x I  x=[z'  0]',  zSR11} 


If  x  €  R(A)  ,  x^O,  then 


Ax  = 


A1 

0 


=  A]_z  H  0 


Thus 

X^n(A)  ==>  N(A)  H  R(A)  =  0 
and 

N(A)  ©  R(A)  =  follows  by  counting  dimensions. 

2— >1 

If  R(A)®N(A)=Rn+m  but  A^  is  singular,  then  there  is  some  z^O  such 
that  A^z=0.  Let  x=[z'  0]'.  Then 


Ax  =  0  — >  x  €  N(A) 

But  because  [A-^  A2]  has  full  row  rank,  we  can  find  a  y  such  that 
[A^  A2]y=z.  Therefore 

Ay  =  [z1  0] '  =  x 


or  O^x  €  R(A)  Hn(A)  .  This  is  a  contradiction. 


2— >3  See  reference  [1],  [16]. 


Now  consider  the  similarity  transformation 


having  SSNS  is  equivalent  to  being  nonsingular.  Furthermore,  the 
projection  along  R(A)  onto  N(A)  is  given  by  (A2.2).  In  other  words,  the 
eigenpro j ect ion  of  A  for  the  zero  eigenvalue  can  be  computed  directly 
and  easily. 

Laima  2  computes  matrices  F^g* 


1 

Define 

pi,0  *  pipi-l“*p0 


(A2.3) 


where  is  the  projection  along  R(F^q)  onto  N(F^q)  « 


i-2,0 


i-]_ 


-s1 

bil 

,2 

-*i1 

-S1 

i2 

• 

~b±z 

• 

r°i, 

1 

i 

* 

0 

cn-j>1 

'Si2 

„n-i+1 


*1 


(A2.4) 


where  the  Sj^j  are  defined  as  before.  Also 


V 

J  «  •  • 

il ^i,i  il *  *  *  *  il ^i,n 

Q  1  ^*|1  J  1  Q  1  rtl  |  1  Q  1  1  ,  1 

bi2wi,i  bi2°i ,i+l • • • •bi2win 

•  •  • 

s>: 

•  •  • 

*«  **.  • 

•  •  • 

.1  .1,1-1  .1,1  .1  .1,1 
i,  j-1 vi,ibi,i  i, j+1 *  *  *  *°i, j-1 win 

(A2.5)  ‘:y> 

.1,1  .1,1  ,.1,1 
"i,i  win 

y 

0 

V'-.-J 

• 

• 

»"**  *  » 

,< 

V. 

J  %  0  •  V 

)  6 

Proof: 

Let  the  Taylor  expansion  of  A(c)  be 
A(c)  »  Fq0+cF01+c2F02+... 


Fran  the  special  form  of  A(c)  we  can  observe  that 


Similarly, 


pi,i-l 


x  x 

X  x 

Jil  Ji 2 

0  0 

0  0 


where  x's  represent  some  terms  which  may  be  non-zero. 


Now  suppose  that  (A2.4)  is  valid  for  i£k.  We  would  like  to  prove 


that  if  is  valid  for  i*k+l.  Fran  the  special  form  of  (A2.4),  namely  the 
first  i-1  columns  are  zero  vectors,  and  the  special  form  of  F^g  shown 
in  Coder  ch's  thesis  [1]  we  can  observe  that 


Fk-1,0  *  pk-2pk-3*«**p0F0fk-lp0*‘“plc-3plc-2 

*  pk-2pk-3 •  •  •  •pOF0,k'-lpk-2pk-3 ••••p0 

*  pk-2/0p0,k-lpk-2,0 


(A2.6) 


The  other  terms  in  Fj^  g  which  are  related  to  Fo,k-2'  p0,k-3'* 
vanish  because  of  the  special  forms  of  P^-2,0  2111(3  F0fi»  i*k“2,k-3M 
Also  can  be  rewritten  as 


Fk-1,0  “  pk-2,0E0,k-lpk-2,0 


where 


^rk-l 


0  0 

Jkl  ^k2 

0  0 

0  0 


Then  using  Eq. (A2.2)  of  Property  2  we  have 


(A2.6) 


0...0 


Fk-1 ,0=?k-2,0  : 


1  n  1  1  ril  » 1 

Pkk  nr,k+1  •••wk,n 
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•-L 
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-  *  *  *  -  ""kl^kk  “bk1Gk,k+1 •***“Sk1Ckn 


k , k- 1 vk,k  *k,k-1 ^k,k+1 *  *  * ‘  5k,k-1 vk,n 


-1 ,1 


0...0 


,cl  ,  1  .1,1 

Tc.k+l  -  'Jk,n 

0 


0 


(A2.7) 


which  proves  (A2.5) . 

Next  we  prove  that  the  projection  Pj^  which  is  the  projection 
along  onto  NtF^^gi  La  o£  the  form 


'kl 


(.'■'1>^'\”1p1»1  ql  /  .1  ,  1  \  —  l^l.f 

bk1  ^  "'kk  '  ^k.k+i  •  •  •  *^k1  ^kk 


=4 

k,k-1 


kr^kk  '  ~k,k+1 

.1  ,.i,iN-i.i,i 


K.n 


-  (  G,1  *  1  1"  ‘  "  ’  >'  /  .1  ,1  \  .  1  , 1 

k,k-r-kK  '  wk,k+T***sk,k-1(okk  )vk,v 


-^d'r'4:L . 


(A  M) 


It  is  easy  to  see  that 


Therefore  is  a  projection.  Also  it  is  easy  to  check  that 

Pk-lTk-l ,0  "  0 

Then  if  we  can  prove  that 


*«N<Fk-r,o>  “>  Vi*8* 


we  are  done.  Suppose 

pk-1,0*a°'  *'  *  <xt . V' 


We  have 


§0  ^/k+l^+i  *  0 


(A2 


Then  let 


y  *  pk-ix  *  tyi'-Vi' 


we  have  for  i£k-l 


Yi 


V\,i 


n-k 


Because  of  (A2.9) , 


ui  ■  *i,  i^H-1 


For  i«k 


Y  -  -  (  C.^  *  ^  ^  H-r-k ,  1  -., 

*  '  u<*  5 


(A2.10) 


Left  multiplying  on  both  sides  of  (A2.9)  and  adding  the 

resulting  equation  to  (A2J.0)  we  have 

Yk  *  xk 

It  is  straightforward  then  to  see  that 
y^  »  xif  n>12k+l 

Therefore  we  have  proved  that  if  x  then  This  proves 

that  the  geven  in  (A2.8)  is  the  projection  along  R(Fk-l,(P  onto 

N<Fk-l,0> • 


Then  using  (A2.4) ,  (A2.8)  and  property  3r  Elq.  (A2.7) 


k-1,0  ~  k-1 ' k-2,0 


0...0  -sfr.+s*  ( c,L* * ) 


~1 ,1 

~n-k+1 

v'k,k+1  * 

1„1 ,1 

-,n-k+1 

~k,k+1 

“O,.  s 

A»1 

+3‘  (C.'» 

u  I  4) 


+s. (0,  ! 

^.1  -V  , 


1  -k,k;  <t,k+1 


^  °t .-lr  '  '-'V 


ax 


1  . 


*  1  . 


By  definition  (A1.6)  and  Property  3,  (A2.7) 


V  ...  yrt  ^k,k+1 


XI  XX'  X,X' 


—  ( » 1  \  - 1  r2  ,k-i-1  /  ,1  >  —  1  p,  1  >k-i  ✓  pi  ,1  ^  —  1  _  1  , 

U  ii;  -i,k  +  uxk  J  ci,k  (okk  5  °k;; 

_(  ^  ^  r ^  >  A— i*»  1  ,  pi  ,k-i,  ,1  \  —  1  pi  jl 

^  ii  ;  °i,k+1  +  °ik  (Ckk  '  °k,k 


,k+1 


—  f  1  <■» t  >k— i  _  cl 

^ °ii  ;  i,k+1  ‘  bk+1 ,i 


Than  we  have 


%.vOlX.  ■ 


Ar^  An*  *  *  *~A1 1  A1n1 


s\  *~<2  q2i*1 

u  -52l  ”°21 ••*”°21 

•  1  . 


0  *.1 


which  is  of  the  form  of  (A2.4) . 


So  far  we  have  obtained  the  explicit  expression  for  Fj__]_f0' 
i»l,.-,n.  Based  on  this  result  we  are  able  to  derive  the  time  scale 
deoonposition.  That  is,  we  can  compute 


n— 1 

exp{  £  F^c^t} 
k— 0 


(A2.12) 


as  the  approximation  to  exp{A(e)t}.  In  order  to  prove  Theorem  2.3  we 
need  only  diagonalize  (A2.12) . 


Proof  of  Theorem  2.3: 

Prom  the  special  form  of  of  (A2.5)  and  T”1  as  defined  in 
Theorem  2.3,  we  see  that  the  j-th  column  of  (j2i)  is  nothing  more 
than  the  i-th  column  of  T”1  multiplied  on  the  right  by  j.  Because 
any  row  of  T  except  the  i-th  row  times  i-th  column  of  T"1  must  be  zero 
and  the  i-th  row  of  T  times  i-th  column  of  T"1  is  1,  we  have 


SNV 


\ 


! 

i 

V 

i 

i 

I 

i 


=  diag{0/ » « • / 0/ A^^/ 0/ . • • • / 0 } 

Therefore  we  have 
n-1 

Te*p{£  Pk0eKt}T_i 
k=0 

n-1  ..  . 

=  exp{£  TF^qT-1^} 
k=0 

11-1  v 
=  exp{£  G-je^t} 

k=0 

=  exp{diag[A11,....,enAnn]t} 


But  we  know  that 


lim  sup  I  |exp{A(e)t}  -  exp{  Z]  ^ico6^ '  I 
e->0  t20  k=0 


0 


(A2.14) 


Combining  (A2.13)  and  (A2.14)  we  have 


lim  sup  I  |exp{A(e) t}  -  T^^{diag[Ai;L,. ..^A^tiTl  |=0 
e->0  t20 


which  is  Theorem  2.3. 


I 


t 
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APPENDIX  3 

In  this  appendix  we  describe  some  basic  properties  of  permutation 
matrix. 

Lenina  1  If  T  is  a  permutation  matrix  and  A  is  a  matrix,  then 

1.  TAUT1  is  a  symmetric  permutation  on  A. 

2.  If  TAQ  is  a  symmetric  permutation  on  A  then  T=Q-1. 

EhQQ£- 

1.  T  can  be  expressed  as  product  of  finite  number  of  2  cycles. 
Suppose  T=t ^  *t2  *tjn.  Then  it  is  easy  to  see  that  T_1=tm*tm_ 
;L*...*t2*ti  and  TAT-1  is  a  symmetric  permutation  on  A. 

2.  Since  TQ=TIQ®I  by  definition  of  symmetric  permutation,  T=Q  . 

Lemma  2: 

Any  permutation  matrix  can  be  brought  to  BDC  form  by  similarity 
transformation 

TAT-1  =  diag{C1,C2,...,Cn} 

where  T  is  a  permutation  matrix  and  TAT-1  is  a  symmetric  permutation  on 
A. 

Proof: 

We  already  showed  that  a  permutation  can  be  represented  by  its 
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cycles  and  this  representation  is  unique, 


The  idea  behind  this  proof  is  similar  to  that  of  bringing  a  matrix 
into  its  Jordan  form  where  a  new  basis  which  span  invariant  subspaces  is 
chosen.  A  linear  transformation  T  serves  as  the  basis  transform.  In 
the  new  basis,  the  matrix  will  have  block  diagonal  form. 

A  =  TAT-1  =  diag{J1,....,Jn} 
where  J^  are  Jordan  blocks. 

Similarly,  here  a  simple  reordering  (a  permutation)  will  do.  For 
example,  suppose  A  has  only  one  cycle. 

°A  ”  (x3/xpx2)  (°r  °as(3,1,2)  for  simplicity) 


Here  0A  is  not  a  circulant  permutation.  But  it  is  easy  to  see  that  if 
we  switch  X3  and  X2 ,  then  in  the  new  coordinate  system  0A  will  be  a 
circulant  permutation.  Namely,  if 


&STAT-1 


10  0 
0  0  1 
0  10 


0  0  1 
10  0 
0  10 


10  0 
0  0  1 
0  10 


0  10 
0  0  1 
10  0 


then  0j=(2,l,3)  is  a  circulant  permutation. 

In  general  if  T  is  a  permutation  matrix  which  permutes  and  Xj 
and  S=TAT-1  then  it  is  easy  to  check  that  0A  will  be  the  same  as  0A 
except  x^  and  Xj  have  switched  positions.  We  also  know  that  by  finite 


number,  say  m,  of  switches  of  two  entries,  we  can  "sort"  an  arbitrary  n 
entry  array  in  the  canonical  form.  Suppose  product  of  permutation 
matrices  is  still  a  permutation  matrix,  we  proved  the  lenma  if  there  is 
only  one  cycle. 

If  there  are  several  cycles,  one  can  first  regroup  the  entries  and 
then  use  the  same  schema  as  for  single  cycle. 
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